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NOVEL HUMAN KINASES AND 
POLYNUCLEOTIDES ENCODING THE SAME 

The present application claims the benefit of U.S. 
Provisional Application Number 60/258,335 which was filed on 
5 December 27, 2000 and is herein incorporated by reference in 
its entirety. 

1 . INTRODUCTION - 
The present invention- relates to the discovery, 
identification, and characterization of novel human 

10 polynucleotides encoding proteins sharing sequence similarity 
with animal kinases. The invention encompasses the described 
polynucleotides, host cell expression systems, the encoded 
proteins, fusion proteins, polypeptides and peptides, 
antibodies to the encoded proteins and peptides, and 

15 genetically engineered animals that either lack or overexpress 
the disclosed genes, antagonists and agonists of the proteins, 
and other compounds that modulate the expression or activity 
of the proteins encoded by the disclosed genes, which can be 
. used for diagnosis, drug screening, clinical trial monitoring,. 

20 the treatment of diseases and disorders, and cosmetic or 
nutriceutical applications. 

2 . BACKGROUND OF THE INVENTION 
. Kinases mediate the phosphorylation of a wide variety of 
proteins and compounds in the cell. Along with phosphatases, 
25 kinases are involved in a range of regulatory pathways. Given 
the physiological importance of kinases, they have been 
subject to intense scrutiny and are proven drug targets. 

3. SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
30 identification, and characterization of nucleotides that 
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encode novel human proteins and the corresponding amino acid 
sequences of these proteins. The novel human proteins (NHPs) 
described for the first time herein share structural 
similarity with animal kinases, including, but not limited to, 
5 serine- threonine kinases, and particularly Citron rho- 

interacting kinases. The described sequences describe a full 
length version of previously reported proteins that were 
erroneously presumed to be full length. Accordingly, the 
described NHPs encode novel kinases having homologues and 
10 orthologs across a range of phyla and species. 

The novel human polynucleotides described herein, encode 
alternative open reading frames (ORFs) encoding proteins of 
2054 and 1958 amino acids in length (see respectively SEQ ID 
NOS : 2 and 4 ) . 

15 The invention also encompasses agonists and antagonists 

of the described NHPs, including small molecules, large 
molecules, mutant NHPs, or portions thereof, that compete with 
native NHP, peptides, and antibodies, as well as nucleotide 
sequences that can be used to inhibit the expression of the 

20 described NHPs (e.g., antisense and ribozyme molecules, and 
open reading frame or regulatory sequence replacement 
constructs) or to enhance the expression of the described NHPs 
(e.g., expression constructs that place the described 
polynucleotide under the control of a strong promoter system) , 

25 and transgenic animals that express a NHP sequence, or "knock- 
outs" (which can be conditional) that do not express a 
functional NHP. Knock-out mice can be produced in several 
ways, one of which involves the use of mouse embryonic stem 
cells ("ES cells") lines that contain gene trap mutations in a 

30 murine homolog of at least one of the described NHPs. When 
the unique NHP sequences described in SEQ ID NOS: 1-4 are 
"knocked-out" they provide a method of identifying phenotypic 
expression of the particular gene as well as a method of 



WO 02/059325 



PCT/US01/50497 



assigning function to previously unknown genes. In addition, 
animals in which the unique NHP sequences described in SEQ. ID 
NOS:l-4 are "knocked-out " provide a unique source in which to 
elicit antibodies to homologous and orthologous proteins which 
5 would. have been previously viewed by the immune system as 
"self" and therefore would have failed to elicit significant 
antibody responses. To these ends, gene trapped knockout ES 
cells have been generated in murine homologs of the described 
NHPs. 

10 Additionally, the unique NHP sequences described in SEQ 

ID N0S:l-4 are useful for the identification of protein coding 
sequence and mapping a unique gene to a particular chromosome. 
These sequences identify actual, biologically verified, and 
therefore relevant, exon splice junctions as opposed to those 

15 that may have been bioinf ormatically predicted from genomic 
sequence alone. The sequences of the present invention are 
also useful as additional DNA markers for restriction fragment 
length polymorphism (RFLP) analysis, and in forensic biology. 
Further, the present invention also relates to processes 

20 for identifying compounds that modulate, i.e., act as agonists 
or antagonists, of NHP expression and/or NHP activity that 
utilize purified preparations of the described NHPs and/ or NHP 
product, or. cells expressing the same. Such compounds can be 
used as therapeutic agents for the treatment of any of a wide 

25 variety of symptoms associated with biological disorders or 
imbalances . 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The Sequence Listing provides the sequence of the novel 
30. human ORFs encoding the described novel human kinase proteins. 
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5. DETAILED DESCRIPTION OF THE INVENTION 
The NHPs described for the first time herein are novel 
proteins whose expression can be detected in, inter alia, 
human cell lines and human testis, small intestine, fetal 
5 kidney, 6- and 9-week embryos, adenocarcinoma, osteosarcoma, 
and embryonic carcinoma cells. The described sequences were 
compiled from sequences available in GENBANK (AC016922) , and 
cDNAs generated from human fetal kidney, testis, and lymph 
node mRNAs (Edge Biosystems, Gaithersburg, MD) that were 

10 identified using primers generated from human genomic DNA. 
The present invention encompasses the nucleotides 
presented in the Sequence Listing, host cells expressing such 
nucleotides, the expression products of such nucleotides, and: 
(a) nucleotides that encode mammalian homologs of the 

15 described genes, including the specifically described NHPs, 
and the NHP products; (b) nucleotides that encode one or more 
portions of an NHP that correspond to functional domains, and 
the polypeptide products specified by such nucleotide 
sequences, including but not limited to the novel regions of 

20 any active domain (s) ; (c) isolated nucleotides that encode 
mutant versions, engineered or naturally occurring, of the 
described NHPs in which all or a part of at least one domain 
is deleted or altered, and the polypeptide products specified 
by such nucleotide sequences, including but not limited to 

25 soluble proteins and peptides in which all or a portion of the 
signal sequence is deleted; (d) nucleotides that encode 
chimeric fusion proteins containing all or a portion of a 
coding region of a NHP, or one of its domains (e.g., a 
receptor/ ligand binding domain, accessory protein/self - 

30 association domain, etc. ) fused to- another peptide or 

polypeptide; or (e) therapeutic or diagnostic derivatives of 
the described polynucleotides such as oligonucleotides, 

4 
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antisense polynucleotides, ribozymes, dsRNA, or gene therapy 
constructs comprising a sequence first disclosed in 
the Sequence Listing. As discussed above, the present 
invention includes: (a) the human DNA sequences presented in 
5 the Sequence Listing (and vectors comprising the same) and 
additionally contemplates any nucleotide sequence encoding a 
contiguous NHP open reading frame (ORF) that hybridizes to a 
complement of a DNA sequence presented in the Sequence Listing 
under highly stringent conditions, e.g., hybridization to 

10 filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate 
(SDS) , 1 mM EDTA at 65°C, and washing in 0.1xSSC/0.1% SDS at 
68°C (Ausubel F.M. et al . , eds., 1989, Current Protocols in 
Molecular Biology, Vol. I, Green Publishing Associates, Inc., 
and John Wiley & Sons, Inc., NY, at p. 2.10.3) and encodes a 

15 functionally equivalent expression product. Additionally, 
contemplated are any nucleotide sequences that hybridize to 
the complement of the DNA sequence that encode and express an 
amino acid sequence presented in the Sequence Listing under 
moderately stringent conditions, e.g.] washing in 0.2xSSC/0.1% 

20 SDS at 42°C (Ausubel et al . , 1989, supra), yet still encode a 
functionally equivalent NHP product. Functional equivalents 
of a NHP include naturally occurring NHPs present in other 
species and mutant NHPs whether naturally occurring or 
engineered (by site directed mutagenesis, gene shuffling, 

25 directed evolution as described in, for example, U.S. Patent 
Nos. 5,837,458 or 5,723,323 both of which are herein 
incorporated by reference) . The invention also, includes 
degenerate nucleic acid variants of the disclosed NHP 
polynucleotide sequences. 

30 Additionally contemplated are polynucleotides encoding 

NHP ORF s, or their functional equivalents, encoded by 
polynucleotide sequences that are about 99, 95, 90, or about 

5 
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85 percent similar to corresponding regions of SEQ ID NO:l (as 
measured by BLAST sequence comparison analysis using, for 
example, the GCG sequence analysis package using default 
parameters) . 

5 The invention also includes nucleic acid molecules, 

preferably DNA molecules, that hybridize to, and are therefore 
- the complements of, the described NHP encoding 
polynucleotides. Such hybridization conditions can be highly 
stringent or less highly stringent, as described above. In 

10 instances where the nucleic acid molecules are 

deoxyoligonucleotides ("DNA oligos"), such molecules are 
generally about 16. to about 100 bases long, or about 20 to 
about 80, or about 34 to about 45 bases long, or any variation 
or combination of sizes represented therein that incorporate a 

15 contiguous region of sequence first disclosed in the Sequence 
Listing. Such oligonucleotides can be used in conjunction 
with the polymerase chain reaction (PCR) to screen libraries, 
isolate clones, and prepare cloning and sequencing templates, 
etc. 

20 Alternatively, such NHP oligonucleotides can be used as 

hybridization probes for screening libraries, and assessing 
gene expression patterns (particularly using a micro array or 
high- throughput "chip" format) . . Additionally, a series of the 
described NHP oligonucleotide sequences, or the complements 

25 thereof, can be used to represent all or a portion of the 

described NHP sequences. An oligonucleotide or polynucleotide 
sequence first disclosed in at least a portion of one or more 
of the sequences of SEQ ID NOS: 1-4 can be used as a 
hybridization probe in conjunction with a solid support 

30 matrix/substrate (resins, beads, membranes, plastics, 

polymers, metal or metallized substrates, crystalline or 
polycrystalline substrates, etc.). Of particular note are 

6 
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spatially addressable arrays (i.e., gene chips, microtiter 
plates, etc.) of oligonucleotides and polynucleotides, or 
corresponding oligopeptides and polypeptides, wherein at least 
one of the biopolymers present on the spatially addressable 
5 array comprises an oligonucleotide or polynucleotide sequence 
first disclosed in at least one of the sequences of SEQ ID 
NOS: 1-4, or an amino acid sequence encoded thereby. Methods 
for attaching biopolymers to, or synthesizing biopolymers on, 
solid support matrices, and conducting binding studies thereon 

10 are disclosed in, inter alia, U.S. Patent Nos. 5,700,637, 
5,556,752, 5,744,305, 4,631,211, 5,445,934, 5,252,743, 
4,713,326, 5,424,186, and 4,689,405 the disclosures of which 
. are herein incorporated by reference in their entirety. 

Addressable arrays comprising sequences first disclosed 

15 in SEQ ID NOS: 1-4 can be used to identify and characterize the 
temporal and tissue specific expression of a gene. These - 
addressable arrays incorporate oligonucleotide sequences of 
sufficient length to confer the required- specificity, yet be 
within the limitations of the production technology. The 

2 0 length of these probes is within a range of between about 8 to 
about 2 000 nucleotides. Preferably the probes consist of 60 
nucleotides and more preferably 25 nucleotides from the 
sequences first disclosed in SEQ ID NOS: 1-4. 

For example, a series of the described oligonucleotide 

25 sequences, or the complements thereof, can be used in chip 
format to represent all or a portion of the described 
sequences. The oligonucleotides, typically between about 16 
to about 40 (or any whole number within the stated range) 
nucleotides in length can partially overlap each other and/or 

30 the sequence may be represented using oligonucleotides that do 
not overlap. Accordingly, the described polynucleotide 
sequences shall typically comprise at least about two or three 

7 
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distinct oligonucleotide sequences of at least about 8 
nucleotides in length that are each first disclosed in the 
described Sequence Listing. Such oligonucleotide sequences 
can begin at any nucleotide present within a sequence in the 
5 Sequence Listing and proceed in either a sense (5'-to-3\) 
orientation vis-a-vis the described sequence or in an 
antisense orientation. 

Microarray-based analysis allows the discovery of broad 
patterns of genetic activity, providing new understanding of 

10 gene functions and generating novel and unexpected insight 

into transcriptional processes and biological mechanisms. The 
use of addressable arrays comprising sequences first disclosed 
in SEQ ID N0S:l-4 provides detailed information about 
transcriptional changes involved in a specific pathway, 

15 potentially leading to the identification of novel components 
or gene functions that manifest themselves as novel 
pheno types 

Probes consisting of sequences first disclosed in SEQ ID 
NOS:l-4 can also be used in the identification, selection and 

20 validation of novel molecular targets for drug discovery. The 
use of these unique sequences permits the direct confirmation 
of drug targets and recognition of drug dependent changes in 
gene expression that are modulated through pathways distinct 
from the drugs intended target. These unique sequences 

25 therefore also have utility in defining and monitoring both 
drug action and toxicity. 

As an example of utility, the sequences first disclosed 
in SEQ ID N0S:l-4 can be utilized in microarrays or other 
assay formats, to screen collections of genetic material from 

30 patients who have a particular medical condition. These 

investigations can also be carried out using the sequences \ 
first disclosed in SEQ ID N0S:l-4 in silico and by comparing 

8 
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previously collected genetic databases and the disclosed 
sequences using computer software known to those in the art. 

Thus the sequences first disclosed in SEQ ID NOS:l-4 can 
be used to identify mutations associated with a particular 
5 disease and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 
specifically described using nucleotide sequence, it should be 
appreciated that each of the sequences can uniquely be 
described using any of a wide variety of additional structural 

10 attributes, or combinations thereof. For example, a given 
sequence can be described by the net composition of the 
nucleotides present within a given region of the sequence in 
conjunction with the presence of one or more specific 
oligonucleotide sequence (s) first disclosed in the SEQ ID NOS: 

15 1-4. Alternatively, a restriction map specifying the relative 
positions of restriction endonuclease digestion sites, or 
various palindromic or other specific oligonucleotide 
sequences can be used to structurally describe a given 
sequence. Such restriction maps, which are typically 

20 generated by widely available computer programs (e.g., the 
University of Wisconsin GCG sequence analysis package, 
SEQUENCHER 3.0, Gene Codes Corp., Ann Arbor, MI, etc.), can 
optionally be used in conjunction with one or more discrete 
nucleotide sequence (s) present in the sequence that can be 

25 described by the relative position of the sequence relative to 
one or more additional sequence (s) or one or more restriction 
sites present in the disclosed sequence. 

For oligonucleotide probes, highly stringent conditions 
may refer, e.g., to washing in 6x SSC/0.05% sodium 

30 pyrophosphate at 37°C (for 14-base oligos) , 48°C (for 17-base 
oligos) , 55°C (for 20-base oligos), and 60°C (for 23-base 
oligos) . These nucleic acid molecules may encode or act as 

9 
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NHP gene antisense molecules, useful, for example, in NHP gene 
regulation and/or as antisense primers in amplification 
reactions of NHP gene nucleic acid sequences. With respect to 
NHP gene regulation, such techniques can be used to regulate 
5 biological functions. Further, such sequences can be used as 
part of ribozyme and/or triple helix sequences that are also 
useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides 
can additionally comprise at least one modified base moiety 
10 which is selected from the group including but not limited to 
5- f luorouraci 1 , 5 -bromouracil , 5 -chlorouraci 1 , 5 -iodouraci 1 , 
hypoxanthine, xanthine, 4-acetylcytosine, 

5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 

2 - thiouridine , 5 -carboxymethylaminomethyluracil , 
15 dihydrouracil, beta-D-galactosylqueosine, inosine, 

N6 - i sopent eny ladenine , 1 -me thy lguanine , 1 -methyl inos ine , 
2 , 2 -dime thy lguanine, 2-methyladenine, 2-methylguanine, . 

3- methylcytosine, 5-methylcytosine, N6-adenine, 
7 -me thy lguanine , 5 -me thy laminome thy lur ac i 1 , 

20 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 
5'-methoxycarboxymethyluracil, 5-methoxyuracil , 
2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid 
(v) , wybutoxosine, pseudouracil , queosine, 2-thiocytosine, 
5 -me thy 1 - 2 - thi ourac i 1 , 2 - thi ourac i 1 , 4 - thi ourac i 1 , 

25 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil- 
5-oxyacetic acid (v) , 5-methyl-2-thiouracil, 3- (3-amino-3-N-2- 
carboxypropyl ) uracil, (acp3)w, and 2 , 6-diaminopurine . 

The antisense oligonucleotide can also comprise at least 
one modified sugar moiety selected from the group including 

30 but not limited to arabinose, 2-f luoroarabinose, xylulose, and 
hexose . 

In yet another embodiment, the antisense oligonucleotide 
will comprise at least one modified phosphate backbone 

10 
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selected from the group including, but not limited to, a 
phosphorothioate, a phosphorodithioate, a 

phosphoramidothioate, a phosphoramidate, a phosphordiamidate, 
a methylphosphonate, an alkyl phosphotriester , and a 
5 formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide 
is an a-anomeric oligonucleotide. An a-ahomeric 
oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual 3-units, the 

10 strands run parallel to each other (Gautier et al . , 1987, 

Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a 2'-0- 
methylribonucleotide (Inoue et al . , 1987, Nucl. Acids Res. 
15:6131-6148), or a chimeric RNA-DNA analogue (Inoue et al . , 
1987, FEBS Lett. 215:327-330). Alternatively, double stranded 

15 RNA can be used to disrupt the expression and function of a 
targeted NHP. 

Oligonucleotides of the invention can be synthesized by 
standard methods known in the art, e.g. by use of an automated 
DNA synthesizer (such as are commercially available from 

20 Biosearch, Applied Biosystems, etc.). As examples, 

phosphorothioate oligonucleotides can be synthesized by the 
method of Stein et al . (1988, Nucl. Acids Res. 15:3209), and 
methylphosphonate oligonucleotides can be prepared by use of 
controlled pore glass polymer supports (Sarin et al., 1988, 

25 Proc. Natl. Acad. Sci . U.S.A. 85:7448-7451), etc. 

Low stringency conditions are well-known to those of 
skill in the art, and will vary predictably depending on the 
specific organisms from which the library and the labeled 
sequences are derived. For guidance regarding such conditions 

30 see, for example, Sambrook et al . , 1989, Molecular Cloning/ A 
Laboratory Manual (and periodic updates thereof) , Cold Spring 
Harbor Press, NY; and Ausubel et al . , 1989, Current Protocols 
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in Molecular Biology, Green Publishing Associates and Wiley 
Interscience, NY. 

Alternatively, suitably labeled NHP nucleotide probes can 
be used to screen a human genomic library using appropriately 
5 stringent conditions or by PCR. The identification and 
characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 
nucleotide repeats, microsatellite alleles, single nucleotide 
polymorphisms, or coding single nucleotide polymorphisms), 

10 determining the genomic structure of a given locus/allele, and 
designing diagnostic tests. For example, sequences derived from 
regions adjacent to the intron/exon boundaries of the human 
gene can be used to design primers for use in amplification 
assays to detect mutations within the exons, introns, splice 

15 sites (e.g., splice acceptor and/or donor sites), etc., that 
can be used in diagnostics and pharmacogenomics . 

For example, the present sequences can be used in 
restriction fragment length polymorphism (RFLP) analysis to 
identify specific individuals. In this technique, an 

20 individual's genomic DNA is digested with one or more 

restriction enzymes, and probed on a Southern blot to yield 
unique bands for identification (as generally described in 
U.S. Patent No. 5,272,057, incorporated herein by reference). 
In addition, the sequences of the present invention can be 

25 used to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 
enhance the reliability of DNA-based forensic identifications 
by, for example, providing another "identification marker" 
(i.e., another DNA sequence that is unique to a particular 

3 0 individual) . Actual base sequence information can be used for 
identification as an accurate alternative to patterns formed 
by restriction enzyme generated fragments. 

12 
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Further,. a NHP gene homolog can be isolated from nucleic 
acid from an organism of interest by performing PCR using two 
degenerate or "wobble" oligonucleotide primer pools designed 
on the basis of amino acid sequences within the NHP products 
5 disclosed herein. The template for the reaction may be total 
RNA, mRNA, and/or cDNA obtained by reverse transcription of 
mRNA prepared from, for example, human or non-human cell lines 
or tissue known or suspected to express an allele of a NHP 
gene, 

10 The PCR product can be subcloned and sequenced to ensure 

that the amplified . sequences represent the sequence of the 
desired NHP gene. The PCR fragment can then be used to 
isolate a full length cDNA clone by a variety ^of methods. For 
example, the amplified fragment can be labeled and used to 

15 screen a cDNA library, such as a bacteriophage cDNA library. 
Alternatively, the labeled fragment can be used to isolate 
genomic clones via the screening of a genomic library. 

PCR technology can also be used to isolate full length 
cDNA sequences. For example, RNA can be isolated, following 

20 standard procedures, from an appropriate cellular or tissue 

source (i.e., one known, or suspected, to express a NHP gene). 
A reverse transcription (RT) reaction can be performed on the 
RNA using an oligonucleotide primer specific for the most 5' 
end of the amplified fragment for the priming of first strand 

25 synthesis. The resulting RNA/DNA hybrid may then be "tailed" 
using a standard terminal transferase reaction, the hybrid may 
be digested with RNase H, and second strand synthesis may then 
be primed with a complementary primer. Thus, - cDNA sequences 
upstream of the amplified fragment can be isolated. For a 

30 review of cloning strategies that can be used, see e.g., 
Sambrook et al . , 1989, supra. 

13 
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A cDNA encoding a mutant NHP sequence can be isolated, 
for example, by using PCR. In this case, the first cDNA 
strand may be synthesized by. hybridizing an oligo-dT 
oligonucleotide to mRNA isolated from tissue known or 
5 suspected to be expressed in an individual putatively carrying 
a mutant NHP allele, and by extending the new strand with 
reverse transcriptase. The second strand of the cDNA is then 
synthesized using an oligonucleotide that hybridizes 
specifically to the 5' end of the normal sequence. Using these 

10 two primers, the product is then amplified via PCR, optionally 
cloned into a suitable vector, and subjected to DNA sequence 
analysis through methods well-known to those of skill in the 
art. By comparing the DNA sequence of the mutant NHP allele 
to that of a corresponding normal NHP allele, the mutation (s) 

15 responsible for the loss or alteration of function of the 
mutant NHP gene product can be ascertained. 

Alternatively, a genomic library can be constructed using 
DNA obtained from an individual suspected of or known to carry 
a mutant NHP allele (e.g., a person manifesting a NHP- 

20 associated phenotype such as, for example, behavioral 

disorders, immune disorders, obesity, high blood pressure, 
etc.), or a cDNA library can be constructed using RNA from a 
tissue known, or suspected, to express a mutant NHP allele. A 
normal NHP gene, or any suitable fragment thereof, can then be 

25 labeled and used as a probe to identify the corresponding 

mutant NHP allele in such libraries. Clones containing mutant 
NHP sequences can then be purified and subjected to sequence 
analysis according to methods well-known to those skilled in 
the art . 

30 Additionally, an expression library can be constructed 

utilizing cDNA synthesized from, for example, RNA isolated 
from a tissue known, or suspected, to express a mutant NHP - 
allele in an individual suspected of or known to carry such a 

14 
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mutant allele. In this manner, gene products made by the 
putatively mutant tissue may be expressed and screened using 
standard antibody screening techniques in conjunction with 
antibodies raised against a normal NHP product, as described 
5 below. For screening techniques, see, for example, Harlow, E. 
and Lane, eds., 1988, "Antibodies: A Laboratory Manual", Cold 
Spring Harbor Press, Cold Spring Harbor, NY. 

Additionally, screening can be accomplished by screening 
with labeled NHP fusion proteins, such as, for example, 

10 alkaline phosphatase-NHP or NHP-alkaline phosphatase fusion 
proteins. In cases where a NHP mutation results in an 
expression product with altered function (e.g., as a result of 
a missense or a frameshift mutation) , polyclonal antibodies to 
NHP are likely to cross-react with a corresponding mutant NHP 

15 expression product. Library clones detected via their 

reaction with such labeled antibodies can be purified and 
subjected to sequence analysis according to methods well-known 
in the art . , 

An additional application of the described novel human 

2 0 polynucleotide sequences is their use in the molecular 

mutagenesis/evolution of proteins that are at least partially 
encoded by the described novel sequences using, for example, 
polynucleotide shuffling or related methodologies. Such 
approaches are described in U.S. Patent Nos. 5,830,721, 

25 5,837,458, 6,117,679, and 5,723,323 which are herein 
incorporated by reference in their entirety. 

The invention also encompasses (a) DNA vectors that 
contain any of the foregoing NHP coding sequences and/or their 
complements (i.e., antisense) ; (b) DNA expression vectors that 

30 contain any of the foregoing NHP coding sequences opera tively 
associated with a regulatory element that directs the 
expression of the coding sequences (for example, baculovirus 
as described in U.s: Patent No. 5,869,336 herein incorporated 
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by reference) ; (c) genetically engineered host cells that 
contain any of the foregoing NHP coding sequences operatively 
associated with a regulatory element that directs the 
expression of. the coding sequences in the host cell; and (d) 
5 genetically engineered host cells that express an endogenous 
NHP sequence under the control of an exogenously introduced 
regulatory element (i.e., gene activation). As used herein, 
regulatory elements include, but are not limited to, inducible 
and non-inducible promoters, enhancers, operators and other 

10 elements known to those skilled in the art that drive and 

regulate expression. Such regulatory elements include but are 
not limited to the cytomegalovirus (hCMV) immediate early 
gene, regulatable, viral elements (particularly retroviral LTR 
promoters), the early or late promoters of SV40 adenovirus, 

15 the lac system, the trp system, the TAC system, the TRC 
system, the major operator and promoter regions of phage 
lambda, the control regions of fd coat protein, the promoter 
for 3-phosphoglycerate kinase (PGK) , the promoters of acid 
phosphatase, and the promoters of the yeast a-mating factors. 

20 Where, as in the present instance, some of the described 

NHP peptides or polypeptides are thought to be cytoplasmic or 
nuclear proteins (although processed forms or fragments can be 
secreted or membrane associated) , expression systems can be 
engineered that produce soluble derivatives of a NHP 

25 (corresponding to a NHP extracellular and/or intracellular 
domains, or truncated polypeptides lacking one or more 
hydrophobic domains) and/or NHP fusion protein products 
(especially NHP-Ig fusion proteins, i.e., fusions of a NHP 
domain to an IgFc) , NHP antibodies, and anti-idiotypic 

3 0 antibodies (including Fab fragments) that can be used in 

therapeutic applications. Preferably, the above expression 
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systems are engineered to allow the desired peptide or 
polypeptide to be recovered from the culture media. 

The present invention also encompasses antibodies and 
anti-idiotypic antibodies (including Fab fragments), 
5 antagonists and agonists of a NHP, as well as compounds or 
nucleotide constructs that inhibit expression of a NHP 
sequence (transcription factor inhibitors, antisense and 
ribozyme molecules, or open reading frame sequence or 
regulatory sequence replacement constructs), or promote the 

10 expression of a NHP (e.g., expression constructs in which NHP 
coding sequences are operatively associated with expression 
control elements such as promoters, promoter /enhancers, etc.). 

The NHPs or NHP peptides, NHP fusion proteins, NHP 
nucleotide sequences, antibodies, antagonists and agonists can 

15 be useful for the detection of mutant NHPs or inappropriately 
expressed NHPs for the diagnosis of disease. The NHP proteins 
or peptides, NHP fusion proteins, NHP nucleotide sequences, 
host cell expression systems, antibodies, antagonists, 
agonists and genetically engineered cells and animals can be 

20 used for screening for drugs (or high throughput screening of 
combinatorial libraries) effective in the treatment of the 
symptomatic or phenotypic manifestations of perturbing the 
normal function of a NHP in the body. The use of engineered 
host cells and/or animals can offer an advantage in that such 

25 systems allow not only for the identification of compounds 

that bind to the endogenous receptor/ ligand of a NHP, but can 
also identify compounds that trigger NHP-mediated activities 
or pathways . 

Finally, the NHP products can be used as therapeutics. 
30 For example, soluble derivatives such as NHP peptides /domains 
corresponding to NHPs, NHP fusion protein products (especially 
NHP-Ig fusion proteins, i.e., fusions of a NHP, or a domain of 
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a NHP, tq an IgFc) , NHP antibodies and anti-idiotypic 
antibodies (including Fab fragments) , antagonists or agonists 
(including compounds that modulate or act on downstream 
targets in a NHP-mediated pathway) can be used to directly 
5 treat diseases or disorders. For instance, the administration 
of an effective amount of soluble NHP, or a NHP-IgFc fusion 
protein or an anti-idiotypic antibody (or its Fab) that mimics 
the NHP could activate or effectively antagonize the 
endogenous NHP or a protein interactive therewith. Nucleotide 

10 constructs encoding such NHP products can be used to 

genetically engineer host cells to express such products in 
vivo; these genetically engineered cells function as 
"bioreactors" in the body delivering a continuous supply of a 
NHP, a NHP peptide, or a NHP fusion protein to the body. 

15 Nucleotide constructs encoding functional NHPs, mutant NHPs, 
as well as antisense and ribozyme molecules can also be used 
in "gene therapy" approaches for the modulation of NHP 
expression. Thus, the invention also encompasses 
pharmaceutical formulations and methods for treating 

20 biological disorders. 

Various aspects of the invention are described in greater 
detail in the subsections below. 

5.1 THE NHP SEQUENCES 
25 The cDNA sequences and corresponding deduced amino acid 

sequences of the described NHPs are presented in the Sequence 
Listing. 

Expression analysis has provided evidence that the 
described NHPs can be expressed in a relatively narrow range 
30 ' of human tissues. In addition to serine- threonine kinases, 
the described NHPs also share significant similarity to a 
range of additional kinase families, including Citron kinases 
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from a variety of phyla and species (for example GENBANK 
AF086824 and U39904) . Several polymorphisms were detected in 
the described NHPs including a C/G polymorphism at the region 
represented by nucleotide position number 5218 of, for 
5 example, SEQ ID NO:l which can result in a leu or val being 
present at corresponding amino acid (aa) position 1740 of, for 
example, SEQ ID NO: 2, and a C/G polymorphism at the region 
represented by nucleotide position number 6065 of, for 
example, SEQ ID NO:l which can result in an ala or gly being 

10 present at corresponding amino acid position 2022 of, for 
example, SEQ ID NO: 2. 

The gene encoding the described NHPs is apparently 
encoded on human chromosome 12 . 

The described novel human polynucleotide sequences can be 

15 used, among other things, in the molecular 

mutagenesis/evolution of proteins that are at least 
partially encoded by the described novel sequences 
using, for example, polynucleotide shuffling or related 
methodologies. Such approaches are described in U.S. 

20 Patent Nos. 5,83 0,721 and 5,837,458 which are herein 
incorporated by reference in their entirety. 

NHP gene products can also be expressed in transgenic 
animals. Animals of any species, including, but not limited 
to, worms, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, 

25 birds, goats, and non-human primates, e.g., baboons, monkeys, 
and chimpanzees may be used to generate NHP transgenic 
animals . 

Any technique known in the art may be used to introduce a . 
NHP transgene into animals to produce the founder lines of 
30 transgenic animals. Such techniques include, but are not 

limited to pronuclear microinjection (Hoppe, P.C. and Wagner, 
T.E., 1989, U.S. Patent No. 4,873,191); retrovirus-mediated 
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gene transfer into germ lines (Van der Putten et al., 1985, 
Proc. Natl. Acad. Sci., USA 82:6148-6152); gene targeting in 
embryonic stem cells (Thompson et al., 1989, Cell 56:313-321); 
electroporation of embryos (Lo, 1983, Mol Cell. Biol. 3:1803- 
5 1814); and sperm-mediated gene transfer (Lavitrano et al., 

1989, Cell 57:717-723); etc. For a review of such techniques, 
see Gordon, 1989, Transgenic Animals, Intl. Rev. Cytol. 
115:171-229, which is incorporated by reference herein in its 
entirety. 

10 The present invention provides for transgenic animals 

that carry the NHP transgene in all their cells, as well as 
animals which carry the transgene in some, but not all their 
cells, i.e., mosaic animals or somatic cell transgenic 
animals. The transgene may be integrated as a single 

15 transgene or in concatamers, e.g., head-to-head tandems or 
head- to-tail tandems. The transgene may also be selectively 
introduced into and activated in a particular cell-type by 
following, for example, the teaching of Lasko et al:, 1992, 
Proc. Natl. Acad. Sci. USA 89:6232-6236. The regulatory 

20 sequences required for such a cell-type specific activation 
will depend upon the particular cell- type of interest, and 
will be apparent to those of skill in the art. 

When it is desired that a NHP transgene be integrated 
into the chromosomal site of the endogenous NHP gene, gene 

25 targeting is preferred. Briefly, when such a technique is to 
be utilized, vectors containing some nucleotide sequences 
homologous to the endogenous NHP gene are designed for the 
purpose of integrating, via homologous recombination with 
chromosomal sequences, into and disrupting the function of the 

30 nucleotide sequence of the endogenous NHP gene (i.e., 
"knockout" animals) . 

20 



WO 02/059325 



PCTYUS01/50497 



The transgene can also be selectively introduced into a 
particular cell -type, thus inactivating the endogenous NHP 
gene in only that cell-type, by following, for example, the 
teaching of Gu et al . , 1994, Science, 255:103-106. The 
5 regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell-type of 
interest, and will be apparent to those of skill in the art. 

Once transgenic animals have been generated, the 
expression of the recombinant NHP gene may be assayed 

10 utilizing standard techniques. Initial screening may be 

accomplished by Southern blot analysis or PCR techniques to 
analyze animal tissues to assay whether integration of the 
transgene has taken place. The level of mRNA expression of 
the transgene in the tissues of the transgenic animals may 

15 also be assessed using techniques which include but are not 
limited to Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and RT-PCR. 
Samples of NHP gene-expressing tissue, may also be evaluated 
immunocytochemically using antibodies specific for the NHP 

2 0 transgene product. 

The present invention provides for "knockin" animals. 
Knockin animals are those in which a gene that the animal does 
not naturally have in its genome, is inserted. For example, 
when a human gene is used to replace its murine ortholog in 

25 the mouse. Such knockin animals are useful for the in vivo 
study, testing and validation of, intra alia, human drug 
targets as well as for compounds that are directed at the 
same . 
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5.2- NHPS AND NHP POLYPEPTIDES 
NHP products, polypeptides, peptide fragments, mutated, 
truncated, or deleted forms of the NHPs, and/or NHP fusion 
proteins can be prepared for a variety of uses. These uses 
5 include but are not limited to the generation of antibodies, 
as reagents in diagnostic assays, the identification of other 
cellular gene products related to the NHP, as reagents in 
assays for screening for compounds that can be used as 
pharmaceutical reagents useful in the therapeutic treatment of 

10 mental, biological, or medical disorders and disease 
(including cancer) . 

The Sequence Listing discloses the amino acid sequences 
encoded by the described NHP-encoding polynucleotides. The 
NHPs display initiator methionines that are present in DNA 

15 sequence contexts consistent with eucaryotic translation 

initiation sites. The NHPs do not display consensus signal 
sequences which indicates that they may be cytoplasmic or 
possibly nuclear proteins, although they may also be secreted 
or membrane associated. 

20 The' NHP amino acid sequences of the invention include the 

amino acid sequences presented in the Sequence Listing as well 
as analogues and derivatives thereof. Further, corresponding 
NHP homologues from other species are encompassed by the 
invention. In fact, any NHP protein encoded by the NHP 

25 nucleotide sequences described above are within the scope of 
the invention, as are any novel polynucleotide sequences 
encoding all or any novel portion of an amino acid sequence 
presented in the Sequence Listing. The degenerate nature of 
the genetic code is well-known, and, accordingly, each amino 

30 acid presented in the Sequence Listing, is generically 

representative of the well-known nucleic acid "triplet" codon, 
or in many cases codons, that can encode the amino acid. As 
such, as contemplated herein, the amino acid sequences 
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presented in the Sequence Listing, when taken together with 

the genetic code (see, for example, Table 4-1 at page 109 of 

"Molecular Cell Biology", 1986, J. Darnell et al. eds . , 

Scientific American Books, New York, NY, herein incorporated 

5 by reference) are generically representative of all the 

various permutations and combinations of nucleic acid 

sequences that can encode such amino acid sequences . 

« 

The invention also encompasses proteins that are 
functionally equivalent to the NHPs encoded by the presently 

10 described nucleotide sequences as judged by any of a number of 
criteria, including, but not limited to, the ability to bind 
and modify a NHP substrate, or the ability to effect an 
identical or complementary downstream pathway, or a change in 
cellular metabolism (e.g. , proteolytic activity, ion flux, 

15 . tyrosine phosphorylation, etc.) . Such functionally equivalent 
NHP proteins include, but are not limited to, additions or 
substitutions of amino acid residues within the amino acid 
sequence encoded by the NHP nucleotide sequences described 
above, but which result in a silent change, thus producing a 

20 functionally equivalent expression product. Amino acid 
substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobici ty, hydrophilicity, 
and/or the amphipathic nature of the residues involved. For 
example, nonpolar (hydrophobic) amino acids include alanine, 

25 leucine, isoleucine, valine, proline, phenylalanine, 

tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, 
and glutamine; positively charged (basic) amino acids include 
arginine, lysine, and histidine; and negatively charged 

30 (acidic) amino acids include aspartic acid, and glutamic acid. 

A variety of host-expression Vector systems can be used 
to express the NHP nucleotide sequences of the invention. 
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Where the NHP peptide or polypeptide can exist, or has been 
engineered to exist, as a soluble or secreted molecule, the 
soluble NHP peptide or polypeptide can be recovered from the 
culture media. Such expression systems also encompass 
5 engineered host cells that express a NHP, or functional 

equivalent, in situ. Purification or enrichment of a NHP from 
such expression systems can be accomplished using appropriate 
detergents and lipid micelles and methods well-known to those 
skilled in the art. However, such engineered host cells 

10 themselves may be used in situations where it is important not 
only to retain the structural and functional characteristics 
of the NHP, but to assess biological activity, e.g., in 
certain drug screening assays. 

The expression systems that may be used for purposes of 

15 the invention include, but are not limited to, microorganisms 
such as bacteria (e.g., E. coli, B. subtilis) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing NHP nucleotide sequences; yeast 
(e.g., Saccharomyces, Pichia) transformed with recombinant 

20 yeast expression vectors containing NHP nucleotide sequences; 
insect cell systems infected with recombinant virus expression 
vectors (e.g., baculovirus) containing NHP nucleotide 
sequences; plant cell systems infected with recombinant virus 
expression vectors (e.g., cauliflower mosaic virus, CaMV; 

2 5 tobacco mosaic virus, TMV) or transformed with recombinant 
plasmid expression vectors (e.g., Ti plasmid) containing NHP 
nucleotide sequences; or mammalian cell systems (e.g., COS, 
CHO, BHK, 293, 3T3) harboring recombinant expression 
constructs containing NHP nucleotide sequences and promoters 

30 derived from the genome of mammalian cells (e.g., 

metallothionein promoter) or from mammalian viruses (e.g., the r 
adenovirus late promoter; the vaccinia virus 7 . 5K promoter). 
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In bacterial systems, a number of expression vectors may 
be advantageously selected depending upon the use intended for 
the NHP product being expressed. For example, when a large 
quantity of such a protein is to be produced for the 
5 generation of pharmaceutical compositions of or containing 
NHP, or for raising antibodies to a NHP, vectors that direct 
the expression of high levels of fusion protein products that 
are readily purified may be desirable. Such vectors include, 
but are not limited, to the E. coli expression vector pUR278 

10 (Ruther et al., 1983, EMBO J. 2:1791), in which a NHP coding 
sequence may be ligated individually into the vector in frame 
with the lacZ coding region so that a fusion protein is 
produced; pIN vectors (Inouye & Inouye, 1985, Nucleic Acids 
Res. 13:3101-3109; Van Heeke & Schuster, 1989, J. Biol. Chem. 

15 254:5503-5509); and the like. pGEX vectors may also be used 
to express foreign polypeptides as fusion proteins with gluta- 
thione S-transferase (GST). In general, such fusion proteins 
are soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in 

20 the presence of free glutathione. The PGEX vectors are 

designed to include thrombin or factor Xa protease cleavage 
sites so that the cloned target expression product can be 
released from the GST moiety. 

In an insect system, Autographs californica nuclear 

25 polyhedrosis virus (AcNPV) is used as a vector to express 
• foreign polynucleotide sequences. The virus grows in 

Spodoptera frugiperda cells. A NHP coding sequence can be 
cloned individually into non-essential regions (for example 
the polyhedrin gene) of the virus and placed under control of 

30 an AcNPV promoter (for example the polyhedrin promoter) . 

Successful insertion of NHP coding sequence will result in 
inactivation of the polyhedrin gene and production of non- 
25 
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occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene) . These 
recombinant viruses are then used to infect Spodoptera 
frugiperda cells -in which the inserted sequence is expressed 
5 (e.g., see Smith et al . , 1983, J. Virol. 46: 584; Smith, U.S. 
Patent No. 4,215,051). 

In mammalian host cells, a number of viral-based 
expression systems may be utilized. In cases where an 
adenovirus is used as an expression vector, the NHP nucleotide 

10 siequence of interest may be ligated to an adenovirus 

transcription/ translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric 
sequence may then be inserted in the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential 

15 region of the viral genome (e.g., region El or E3) will result 
in a recombinant virus that is viable and capable of 
expressing a NHP product in infected hosts (e.g., See Logan & 
Shenk, 1984, Proc. Natl. Acad. Sci . USA 81:3655-3659). 
Specific initiation signals may also be required for efficient 

20 translation of inserted NHP nucleotide sequences. These 
signals include the ATG initiation codon and adjacent 
sequences. i n cases where an entire NHP gene or cDNA, 
including its own initiation codon and adjacent sequences, is 
inserted into the appropriate expression vector, no additional 

25 translational control signals may be needed. However, in 
cases where only a portion of a NHP coding sequence is 
inserted, exogenous translational control signals, including, 
perhaps, the ATG initiation codon, must be provided. 
Furthermore, the initiation codon must be in phase with the 

30 reading, frame of the desired coding sequence to ensure 
translation of the entire insert. These exogenous 
translational control signals and initiation codons can be of 
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a variety of origins, both natural and synthetic. The 
efficiency of expression may be enhanced by the inclusion of 
appropriate transcription enhancer elements, transcription 
terminators, etc. (See Bitter et al., 1987, Methods in 
5 Enzymol. 153:516-544). 

In addition, a host cell strain may be chosen that 
modulates the expression of the inserted sequences, or 
modifies and processes the expression product in the specific 
fashion desired. Such modifications (e.g., glycosylation) and 

10 processing (e.g., cleavage) of protein products may be 

important for the function of the protein. Different host 
cells have characteristic and specific mechanisms for the 
post-translational processing and modification of proteins arid 
expression products. Appropriate cell lines or host systems 

15 can be chosen to ensure the correct modification and 

processing of the foreign protein expressed. To this end, 
eukaryotic host cells which possess the cellular machinery for 
proper processing of the primary transcript, glycosylation, 
and phosphorylation of the expression product may be used. 

20 Such mammalian host cells include, but are not limited to, 
CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and in 
particular, human cell lines. 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 

25 lines that. stably express the NHP sequences described above 

can be engineered. Rather than using expression vectors which 
contain viral origins .of replication, host cells can be 
transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences,. 

30 transcription terminators, polyadenylation sites, etc.), and a 
selectable marker. Following the introduction of the foreign 
DNA, engineered cells may be allowed to grow for 1-2 days in 
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an enriched media, and then are switched to a selective media. 
The selectable marker in the recombinant plasmid confers 
resistance . to the selection and allows cells to stably- 
integrate the plasmid into their chromosomes and grow to form 
5 foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines 
which express the NHP product. Such engineered cell lines may 
be particularly useful in screening and evaluation of 
compounds that affect the endogenous activity of the NHP 
10 product. 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase 
(Wigler et al . , 1977 , Cell 22:223), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska and Szybalski, 1962, 

15 Proc. Natl. Acad. Sci. USA 48:2026), and adenine 

phosphoribosyltransferase (Lowy et al . , 1980, Cell 22:817) 
genes, which can be employed in tk", hgprt" or aprt" cells, 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 

20 confers resistance to methotrexate (Wigler et al . , 1980, Proc. 
Natl. Acad. Sci. USA 77:3567; O'Hare et al . , 1981, Proc. Natl. 
Acad. Sci. USA 78:1527); gpt, which confers resistance to 
mycophenolic acid (Mulligan and Berg, 1981, Proc. Natl. Acad. 
Sci. USA 78:2072); neo, which confers resistance to the 

25 aminoglycoside G-418 (Colberre-Garapin et al . , 1981, J. Mol. 
Biol. 250:1); and hygro, which confers resistance to 
hygromycin (Santerre et al . , 1984, Gene 30:147). 

Alternatively, any fusion protein can be readily purified 
by utilizing an antibody specific for the fusion protein being 

30 expressed. For example, a system described by Janknecht et 

al. allows for the ready purification of non-denatured fusion . 
proteins expressed in human cell lines (Janknecht, et al,, 
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1991, Proc. Natl. Acad. Sci . USA 88:8972-8976). In this 
system, the sequence of interest is subcloned into a vaccinia 
recombination plasmid such that the sequence's open reading 
frame is translationally fused to an amino- terminal tag 
5 consisting of six histidine residues. Extracts from cells 
infected with recombinant vaccinia virus are loaded onto 
Ni 2+ -nitriloacetic acid-agarose columns and histidine-tagged 
proteins are selectively eliited with imidazole-containing 
buffers. 

10 Also encompassed by the present invention are fusion 

proteins that direct the NHP to a target organ and/or 
facilitate transport across the membrane into the cytosol. 
Conjugation of NHPs to antibody molecules or their Fab 
fragments could be used to target cells bearing a particular 

15 epitope. Attaching the appropriate signal sequence to the NHP 
would also transport the NHP to the desired location within 
the cell. Alternatively targeting of NHP or its nucleic acid 
sequence might be achieved using liposome or lipid complex 
based delivery systems. Such technologies are described in 

20 n Liposomes:A Practical Approach", New, R.R.C., ed., Oxford 

University Press, New York and in U.S. Patent Nos. 4,594,595, 
5,459,127, 5,948,767 and 6,110,490 and their respective 
disclosures which are herein incorporated by reference in 
their entirety. Additionally embodied are novel protein 

25 constructs engineered in such a way that they facilitate 
transport of the NHP to the target site or desired organ, 
where they cross the cell membrane and/or the nucleus where 
the NHP can exert its functional activity. This goal may be 
achieved by coupling of the NHP to a cytokine or other ligand 

30 that provides targeting specificity, and/or to a protein 

transducing domain (see generally U.S. applications Ser. No. 
60/111,701 and 60/056,713, both of which are herein 
incorporated by reference, for examples of such transducing 

29 



WO 02/059325 



PCT/US01/50497 



sequences) to facilitate passage across cellular membranes and 
can optionally be engineered to include nuclear localization. 

5.3 ANTIBODIES TO NHP PRODUCTS 
5 Antibodies that specifically recognize one or more 

epitopes of a NHP, or epitopes of conserved variants of a NHP, 
or peptide fragments of a NHP are also encompassed by the 
invention. Such antibodies include but are not limited to 
polyclonal antibodies, monoclonal antibodies (mAbs) , humanized 

10 or chimeric antibodies, single chain antibodies, Fab 

fragments, F(ab') 2 fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, and 
epi tope-binding fragments of any of the above. 

The antibodies of the invention can be used, for example, 

15 in the detection of NHP in a biological sample and may, 

therefore, be utilized as part of a diagnostic or prognostic 
technique whereby patients may be tested for abnormal amounts 
of NHP. Such antibodies may also be utilized in conjunction 
with, for example, compound screening schemes for the 

20 evaluation of the effect of test compounds on expression 

4 

and/or activity of a NHP expression product. Additionally, 
such antibodies can be used in conjunction gene therapy to, 
for example, evaluate the normal and/or engineered NHP- 
expressing cells prior to their introduction into the patient. 

25 Such antibodies may additionally be used as a method for the 
inhibition of abnormal NHP activity. Thus, such antibodies 
may, therefore, be utilized as part of treatment methods. 

For the production of antibodies, various host animals 
may be immunized by injection with the NHP, a NHP peptide 

30 (e.g., one corresponding to a functional domain of a NHP), 
truncated NHP polypeptides (NHP in which one or more domains 
have been deleted) , functional equivalents of the NHP or 
mutated variant of the NHP. Such host animals may include but 
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are not limited to pigs., rabbits, mice, goats, and rats, to 
name but a few. Various adjuvants may be used to increase the 
immunological response, depending on the host species, 
including, but not limited to, Freund's adjuvant (complete and 
5 incomplete) , mineral salts such as aluminum hydroxide or 

aluminum phosphate, chitosan, surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and Corynebacterium parvum. 

10 Alternatively, the immune response could be enhanced by 

combination and or coupling with molecules such as keyhole 
limpet hemocyanin, tetanus toxoid, diphtheria toxoid, 
ovalbumin, cholera toxin or fragments thereof. Polyclonal 
antibodies are heterogeneous populations of antibody molecules 

15 derived from the sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous populations 
of antibodies- to a particular antigen, can be obtained by any 
technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include, 

2 0 but are not limited to, the hybridoma technique of Kohler and 
Milstein, (1975, Nature 255:495-497; and U.S. Patent No. 
4,376,110), the human B-cell hybridoma technique (Kosbor et 
al., 1983, Immunology Today 4:72; Cole et al . , 1983, Proc. 
Natl. Acad. Sci . USA 80:2026-203 0), and the EBV-hybridoma 

25 technique (Cole et al . , 1985, Monoclonal Antibodies And Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may 
be of any immunoglobulin class including IgG, IgM, IgE, IgA, 
IgD and any subclass thereof. The hybridoma producing the mAb 
of this invention may be cultivated in vitro or in vivo. 

30 Production of high titers of mAbs in vivo makes this the 
presently preferred method of production. 
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In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison efc al . , 1984, Proc . Natl. 
Acad. Sci.USA, 82:6851-6855; Neuberger et al . , 1984, Nature, 
312:604-608; Takeda efc al . , 1985, Nature, 314:452-454) by 
5 splicing the genes from a mouse antibody molecule of 

appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can 
be used. A chimeric antibody is a molecule in which different 
portions are derived from different animal species, such as 

10 those having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. Such technologies are 
described in U.S. Patent Nos. 6,075,181 and 5,877,397 and 
their respective disclosures which are herein incorporated by 
reference in their entirety. Also encompassed by the present 

15 invention is the use of fully humanized monoclonal antibodies 
as described in US Patent No. 6,150,584 and respective 
disclosures which are herein incorporated by reference in 
their entirety. 

Alternatively, techniques described for the production 

20 of single chain antibodies (U.S. Patent 4,946,778; Bird, 1988, 
Science 242:423-426; Huston et al . , 1988," Proc. Natl. Acad. 
Sci. USA 85:5879-5883; and Ward efc al . , 1989, Nature 341:544- 
546) can be adapted to produce single chain antibodies against 
NHP expression products. Single chain antibodies are formed 

25 by linking the heavy and light chain fragments of the Fv 

region via an amino acid bridge, resulting in a single chain 
polypeptide. 

Antibody fragments which recognize specific epitopes may 
be generated by known techniques. For example, such fragments 
30 include, but are not limited to: the F(ab') 2 fragments which 
can be produced by pepsin digestion of the antibody molecule 
and the Fab fragments which can be generated by reducing the 
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disulfide bridges of the F(ab , ) 2 fragments. Alternatively, 
Fab expression libraries may be constructed (Huse et al . , 
1989, Science, 246:1275-1281) to allow rapid and easy 
identification of monoclonal Fab fragments with the desired 
5 specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 
anti-idiotype antibodies that "mimic" a given NHP, using 
techniques well-known to those skilled in the art. (See, 
e.g., Greenspan & Bona, 1993, FASEB J 7 (5) : 437-444 ; and 

10 Nissinoff, 1991, J. Immunol. 147 (8) : 2429-2438) . For example 
antibodies which bind to a NHP domain and competitively 
inhibit the binding of NHP to its cognate receptor/ligand can 
be used to generate anti-idiotypes that "mimic" the NHP and, 
therefore, bind, activate; or neutralize a NHP, NHP receptor, 

15 or NHP ligand. Such anti-idiotypic antibodies or Fab 

fragments of such ant i-idio types can be used in therapeutic 
regimens involving a NHP-mediated pathway. 

Additionally given the high degree of relatedness of 
mammalian NHPs, the presently described knock-out mice (having 

20 never seen NHP, and thus never been tolerized to NHP) have a 
unique utility, as they can be advantageously applied to the 
generation of antibodies against the disclosed mammalian NHP 
(i.e., NHP will be immunogenic in NHP knock-out animals) . 

The present invention is not to be limited in scope by 

25 the specific embodiments described herein, which are intended 
as single illustrations of individual aspects of the 
invention, and functionally equivalent methods and components 
are within the scope of the invention. Indeed, various 
modifications of the invention, in addition to those shown and 

30 described herein will become apparent to those skilled in the 
art from the foregoing description. Such modifications are 
intended to fall within the scope of the appended claims. All 
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cited publications, patents, and patent applications are 
herein incorporated by reference in their entirety. 
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WHAT IS CLAIMED IS : 

1. An isolated nucleic acid molecule comprising 
at least 24 contiguous bases of nucleotide sequence first 
disclosed in SEQ ID NO:l. 

2. An isolated nucleic acid molecule comprising a 
nucleotide sequence that: 



3. An isolated nucleic acid molecule comprising a 
nucleotide sequence encoding the amino acid sequence shown in 
SEQ ID NO: 2. 

4. An isolated nucleic acid molecule comprising a 
nucleotide sequence encoding the amino acid sequence shown in 
SEQ ID NO: 4. 



(a) 



encodes the amino acid sequence shown in SEQ 
ID NO: 2; and 

hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO:l or the 
complement thereof . 



(b) 
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SEQUENCE LISTING 
<110> LEXICON GENETICS INCORPORATED 

<120> Novel Human Kinases and Polynucleotides Encoding the Same 

<130> LEX-0289-;PCT 

<150> US 60/258,335 
<151> 2000-12-27 

<160> 4 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 6165 

<212> DNA 

<213> homo sapiens 

<400> 1 

atgttgaagt tcaaatatgg agcgcggaat cctttggatg ctggtgctgc tgaacccatt 60 

gccagccggg cctccaggct gaatctgttc ttccagggga aaccaccctt tatgactcaa 120 

cagcagatgt ctcctctttc ccgagaaggg atattagatg ccctctttgt tctctttgaa 180 

gaatgcagtc agcctgctct gatgaagatt aagcacgtga gcaactttgt ccggaagtat 240 

tccgacacca tagctgagtt acaggagctc cagccttcgg caaaggactt cgaagtcaga 300 

agtcttgtag gttgtggtca ctttgctgaa gtgcaggtgg taagagagaa agcaaccggg 3 60 

gacatctatg ctatgaaagt gatgaagaag aaggctttat tggcccagga gcaggtttca 420 

ttttttgagg aagagcggaa catattatct cgaagcacaa gcccgtggat cccccaatta 480 

cagtatgcct ttcaggacaa aaatcacctt tatctggtca tggaatatca gcctggaggg 540 

gacttgctgt cacttttgaa tagatatgag gaccagttag atgaaaacct gatacagttt 600 

tacctagctg agctgatttt ggctgttcac agcgttcatc tgatgggata cgtgcatcga 660 

gacatcaagc ctgagaacat tctcgttgac cgcacaggac acatcaagct ggtggatttt 720 

ggatctgccg cgaaaatgaa ttcaaacaag atggtgaatg ccaaactccc gattgggacc 780 

ccagattaca tggctcctga agtgctgact gtgatgaacg gggatggaaa aggcacctac - 840 

ggcctggact gtgactggtg gtcagtgggc gtgattgcct atgagatgat ttatgggaga 900 

tcccccttcg cagagggaac ctctgccaga accttcaata acattatgaa tttccagcgg 960 

tttttgaaat ttccagatga ccccaaagtg agcagtgact ttcttgatct gattcaaagc 1020 

ttgttgtgcg gccagaaaga gagactgaag tttgaaggtc tttgctgcca tcctttcttc 1080 

tctaaaattg actggaacaa cattcgtaac tctcctcccc ccttcgttcc caccctcaag 1140 

tctgacgatg acacctccaa ttttgatgaa ccagagaaga attcgtgggt ttcatcctct 1200 

ccgtgccagc tgagcccctc aggcttctcg ggtgaagaac tgccgtttgt ggggttttcg 1260 

tacagcaagg cactggggat tcttggtaga tctgagtctg ttgtgtcggg tctggactcc 1320 

cctgccaaga ctagctccat ggaaaagaaa cttctcatca aaagcaaaga gctacaagac 1380 

tctcaggaca agtgtcacaa gatggagcag gaaatgaccc ggttacatcg gagagtgtca 1440 

gaggtggagg ctgtgcttag tcagaaggag gtggagctga aggcctctga gactcagaga 1500 

tccctcctgg agcaggacct tgctacctac atcacagaat gcagtagctt aaagcgaagt 1560 

ttggagcaag cacggatgga ggtgtcccag gaggatgaca aagcactgca gcttctccat 1620 

gatatcagag agcagagccg gaagctccaa gaaatcaaag agcaggagta ccaggctcaa 1680 

gtggaagaaa tgaggttgat gatgaatcag ttggaagagg atcttgtctc agcaagaaga 1740 

cggagtgatc tctacgaatc tgagctgaga gagtctcggc ttgctgctga agaattcaag 1800 

cggaaagcga cagaatgtca gcataaactg ttgaaggcta aggatcaagg gaagcctgaa 1860 

gtgggagaat atgcgaaact ggagaagatc aatgctgagc agcagctcaa aattcaggag 1920 

ctccaagaga aactggagaa ggctgtaaaa gccagcacgg aggccaccga gctgctgcag 1980 

aatatccgcc aggcaaagga gcgagccgag agggagctgg agaagctgca gaaccgagag 2040 
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gattcttctg aaggcatcag aaagaagctg gtggaagctg aggaacgccg ccattctctg 2100 

gagaacaagg taaagagact agagaccatg gagcgtagag aaaacagact gaaggatgac 2160 

atccagacaa aatcccaaca gatccagcag atggctgata aaattctgga gctcgaagag 2220 

aaacatcggg aggcccaagt ctcagcccag cacctagaag tgcacctgaa acagaaagag 2280 

cagcactatg aggaaaagat taaagtgttg gacaatcaga taaagaaaga cctggctgac 2340 

aaggagacac tggagaacat gatgcagaga cacgaggagg aggcccatga gaagggcaaa 2400 

attctcagcg aacagaaggc gatgatcaat gctatggatt ccaagatcag atccctggaa 2460 

cagaggattg tggaactgtc tgaagccaat aaacttgcag caaatagcag tctttttacc 2520 

caaaggaaca tgaaggccca agaagagatg atttctgaac tcaggcaaca gaaattttac 2580 

ctggagacac aggctgggaa gttggaggcc cagaaccgaa aactggagga gcagctggag 2640 

aagatcagcc accaagacca cagtgacaag aatcggctgc tggaactgga gacaagattg 2700 

cgggaggtca gtctagagca cgaggagcag aaactggagc tcaagcgcca gctcacagag 2760 

ctacagctct ccctgcagga gcgcgagtca cagttgacag ccctgcaggc tgcacgggcg. 2820 

gccctggaga gccagcttcg ccaggcgaag acagagctgg aagagaccac agcagaagct 2880 

gaagaggaga tccaggcact cacggcacat agagatgaaa tccagcgcaa atttgatgct 2940 

cttcgtaaca gctgtactgt aatcacagac ctggaggagc agctaaacca gctgaccgag 3000 

gacaacgctg aactcaacaa ccaaaacttc tacttgtcca aacaactcga tgaggcttct 3060 

ggcgccaacg acgagattgt acaactgcga agtgaagtgg accatctccg ccgggagatc 3120 

acggaacgag agatgcagct taccagccag aagcaaacga tggaggctct gaagaccacg 3180 

tgcaccatgc tggaggaaca ggtcatggat ttggaggccc taaacgatga gctgctagaa 3240 

aaagagcggc agtgggaggc ctggaggagc gtcctgggtg atgagaaatc ccagtttgag 3300 

tgtcgggttc gagagctgca gaggatgctg gacaccgaga aacagagcag ggcgagagcc 3360 

gatcagcgga tcaccgagtc tcgccaggtg gtggagctgg cagtgaagga gcacaaggct 3420 

gagattctcg ctctgcagca ggctctcaaa gagcagaagc tgaaggccga gagcctctct 3480 

gacaagctca atgacctgga gaagaagcat gctatgcttg aaatgaatgc ccgaagctta 3540 

cagcagaagc tggagactga acgagagctc aaacagaggc ttctggaaga gcaagccaaa 3 600 

ttacagcagc agatggacct gcagaaaaat cacattttcc gtctgactca aggactgcaa 3660 

gaagctctag atcgggctga tctactgaag acagaaagaa gtgacttgga gtatcagctg 3720 

gaaaacattc aggttctcta ttctcatgaa aaggtgaaaa tggaaggcac tatttctcaa 3780 

caaaccaaac tcattgattt tctgcaagcc aaaatggacc aacctgctaa aaagaaaaag 3840 

gttcctctgc agtacaatga gctgaagctg gccctggaga aggagaaagc tcgctgtgca 3900 

gagctagagg aagcccttca gaagacccgc atcgagctcc ggtccgcccg ggaggaagct 3960 

gcccaccgca aagcaacgga ccacccacac ccatccacgc cagccaccgc gaggcagcag 4020 

atcgccatgt ccgccatcgt gcggtcgcca gagcaccagc ccagtgccat gagcctgctg 4080 

gccccgccat ccagccgcag aaaggagtct tcaactccag aggaatttag tcggcgtctt 4140 

aaggaacgca tgcaccacaa tattcctcac cgattcaacg taggactgaa catgcgagcc 4200 

acaaagtgtg ctgtgtgtct ggataccgtg cactttggac gccaggcatc caaatgtctc 4260 

gaatgtcagg tgatgtgtca ccccaagtgc tccacgtgct tgccagccac ctgcggcttg 4320 

cctgctgaat atgccacaca cttcaccgag gccttctgcc gtgacaaaat gaactcccca 4380 

ggtctccaga ccaaggagcc cagcagcagc ttgcacctgg aagggtggat gaaggtgccc 4440 

aggaataaca aacgaggaca gcaaggctgg gacaggaagt acattgtcct ggagggatca 4500 

aaagtcctca tttatgacaa tgaagccaga gaagctggac agaggccggt ggaagaattt 4560 

gagctgtgcc ttcccgacgg ggatgtatct attcatggtg ccgttggtgc ttccgaactc 4620 

gcaaatacag ccaaagcaga tgtcccatac atactgaaga tggaatctca cccgcacacc 4680 

acctgctggc ccgggagaac cctctacttg ctagctccca gcttccctga caaacagcgc 4740 

tgggtcaccg ccttagaatc agttgtcgca ggtgggagag tttctaggga aaaagcagaa 4800 

gctgatgcta aactgcttgg aaactccctg ctgaaactgg aaggtgatga ccgtctagac 4860 

atgaactgca cgctgccctt cagtgaccag gtggtgttgg tgggcaccga ggaagggctc 4920 

tacgccctga r atgtcttgaa aaactcccta acccatgtcc caggaattgg agcagtcttc 4980 

caaatttata ttatcaagga cctggagaag ctactcatga tagcaggaga agagcgggca 5040 

ctgtgtcttg tggacgtgaa gaaagtgaaa cagtccctgg cccagtccca cctgcctgcc 5100 

cagcccgaca tctcacccaa catttttgaa gctgtcaagg gctgccactt gtttggggca 5160 

ggcaagattg agaacgggct ctgcatctgt gcagccatgc ccagcaaagt cgtcattctc 5220 

cgctacaacg aaaacctcag caaatactgc atccggaaag agatagagac ctcagagccc 5280 

tgcagctgta tccacttcac caattacagt atcctcattg gaaccaataa attctacgaa 5340 

atcgacatga agcagtacac gctcgaggaa ttcctggata agaatgacca ttccttggca 5400 
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cctgctgtgt ttgccgcctc ttccaacagc ttccctgtct caatcgtgca ggtgaacagc 5460 

gcagggcagc gagaggagta cttgctgtgt ttccacgaat ttggagtgtt cgtggattct 5520 

tacggaagac gtagccgcac agacgatctc aagtggagtc gcttaccttt ggcctttgcc 5580 

tacagagaac cctatctgtt tgtgacccac ttcaactcac tcgaagtaat tgagatccag 5640 

gcacgctcct cagcagggac ccctgcccga gcgtacctgg acatcccgaa cccgcgctac 5700 

ctgggccctg ccatttcctc aggagcgatt tacttggcgt cctcatacca ggataaatta 5760 

agggtcattt gctgcaaggg aaacctcgtg aaggagtccg gcactgaaca ccaccggggc 5820 

ccgtccacct cccgcagcag ccccaacaag cgaggcccac ccacgtacaa cgagcacatc 5880 

accaagcgcg tggcctccag cccagcgccg cccgaaggcc ccagccaccc gcgagagcca 5940 

agcacacccc accgctaccg cgaggggcgg accgagctgc gcagggacaa gtctcctggc 6000 

cgccccctgg agcgagagaa gtcccccggc cggatactca gcacgcggag agagcggtcc 6060 

cccgcgaggc tgtttgaaga cagcagcagg ggccggctgc ctgcgggagc cgtgaggacc 6120 

ccgctgtccc aggtgaacaa ggtctgggac cagtcttcag tataa 6165 



<210> 2 

<211> 2054 

<212> PRT 

<213> homo sapiens 

<400> 2 

Met Leu Lys Phe Lys Tyr Gly Ala Arg Asn Pro Leu Asp Ala Gly Ala 

1 5 10 15 

Ala Glu Pro He Ala Ser Arg Ala Ser Arg Leu Asn Leu Phe Phe Gin 

20 25 30 

Gly Lys Pro Pro Phe Met Thr Gin Gin Gin Met Ser Pro Leu Ser Arg 

35 40 45 

Glu Gly He Leu Asp Ala Leu Phe Val Leu Phe Glu Glu Cys Ser Gin 

50 • 55 60 

Pro Ala Leu Met Lys He Lys His Val Ser Asn Phe Val Arg Lys Tyr- 
es 70 75 80 
Ser Asp Thr He Ala Glu Leu Gin Glu Leu Gin Pro Ser Ala Lys Asp 

85 90 95 

Phe Glu Val Arg Ser Leu Val Gly Cys Gly His Phe Ala Glu Val Gin 

100 105 110 

Val Val Arg Glu Lys Ala Thr Gly Asp He Tyr Ala Met Lys Val Met 

115 120 125 

Lys Lys Lys Ala Leu Leu Ala Gin Glu Gin Val Ser Phe Phe Glu Glu 

130 135 140 

Glu Arg Asn He Leu Ser Arg Ser Thr Ser Pro Trp He Pro Gin Leu 
145 150 155 160 

Gin Tyr Ala Phe Gin Asp Lys Asn His Leu Tyr Leu Val Met Glu Tyr 

165 170 175 

Gin Pro Gly Gly Asp Leu Leu Ser Leu Leu Asn Arg Tyr Glu Asp Gin 

180 185 190 

Leu Asp Glu Asn Leu He Gin Phe Tyr Leu Ala Glu Leu He Leu Ala 

195 200 205 

Val His Ser Val His Leu Met Gly Tyr Val His Arg Asp He Lys Pro 

210 215 220 

Glu Asn He Leu Val Asp Arg Thr Gly His He Lys Leu Val Asp Phe 
225 230 . 235 240 

Gly Ser Ala Ala Lys Met Asn Ser Asn Lys Met Val Asn Ala Lys Leu 

o 245 250 255 

Pro He Gly Thr Pro Asp Tyr Met Ala Pro Glu Val Leu Thr Val Met 
260 265 270 
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Asn Gly Asp Gly Lys Gly Thr Tyr Gly Leu Asp Cys Asp Trp Trp Ser 

275 280. 285 

Val Gly Val lie Ala Tyr Glu Met lie Tyr Gly Arg Ser Pro Phe Ala 

290 295 300 

Glu Gly Thr Ser Ala Arg. Thr Phe Asn Asn lie Met Asn Phe Gin Arg 
305 310 315 320 

Phe Leu Lys Phe Pro Asp Asp Pro Lys Val Ser Ser Asp Phe Leu Asp 

325 330 335 

Leu He Gin Ser Leu Leu Cys Gly Gin Lys Glu Arg Leu Lys Phe Glu 

340 345 350 

Gly Leu Cys Cys His Pro Phe Phe Ser Lys He Asp Trp Asn Asn He 

355 360 365 

Arg Asn Ser Pro Pro Pro Phe Val Pro Thr Leu Lys Ser Asp Asp Asp 

370 375 380 

Thr Ser Asn Phe Asp Glu Pro Glu Lys Asn Ser Trp Val Ser Ser Ser 
385 390 395 400 

Pro Cys Gin Leu Ser Pro Ser Gly Phe Ser Gly Glu Glu Leu Pro Phe 

405 410 415 

Val Gly Phe Ser Tyr Ser Lys Ala Leu Gly He Leu Gly Arg Ser Glu 

420 ' 425 430 

Ser Val Val Ser Gly Leu Asp Ser Pro Ala Lys Thr Ser Ser Met Glu 

435 440 445 

Lys Lys Leu Leu He Lys Ser Lys Glu Leu Gin Asp Ser Gin Asp Lys 

450 455 460 

Cys His Lys Met Glu Gin Glu Met Thr Arg Leu His Arg Arg Val Ser 
465 470 475 480 

Glu. Val Glu Ala Val Leu Ser Gin Lys Glu Val Glu Leu Lys Ala Ser 

485 490 495 

Glu Thr Gin Arg Ser Leu Leu Glu Gin Asp Leu Ala Thr Tyr He Thr 

500 505 510 

Glu Cys Ser Ser Leu Lys Arg Ser Leu Glu Gin Ala Arg Met Glu Val 

515 520 525 

Ser Gin Glu Asp Asp Lys Ala Leu Gin Leu Leu His Asp He Arg Glu 

530 535 540 

Gin Ser Arg Lys Leu Gin Glu He Lys Glu Gin Glu Tyr Gin Ala Gin 
545 550 555 560 

Val Glu Glu Met Arg Leu Met Met Asn Gin Leu Glu Glu Asp Leu Val 

565 570 575 

Ser Ala Arg Arg Arg Ser Asp Leu Tyr Glu Ser Glu Leu Arg Glu Ser 

580 585 590 

Arg Leu Ala Ala Glu Glu Phe Lys Arg Lys Ala Thr Glu Cys Gin His 

595 600 605 

Lys Leu Leu Lys Ala Lys Asp Gin Gly Lys Pro Glu Val Gly Glu Tyr 

610 615 620 

Ala Lys Leu Glu Lys He Asn Ala Glu Gin Gin Leu Lys He Gin Glu 
625 630 635 640 

Leu Gin Glu Lys Leu Glu Lys Ala Val Lys Ala Ser Thr Glu Ala Thr 

645 650 655 

Glu Leu Leu Gin Asn He Arg Gin Ala Lys Glu Arg Ala Glu Arg Glu 

660 665 670 

Leu Glu Lys Leu Gin Asn Arg Glu Asp Ser Ser Glu Gly He Arg Lys 

675 680 685 

Lys Leu Val Glu Ala Glu Glu Arg Arg His Ser Leu Glu Asn Lys Val 

690 695 700 

Lys Arg Leu Glu Thr Met Glu Arg Arg Glu Asn Arg Leu Lys Asp Asp 
705 710 715 720 
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lie Gin Thr Lys Ser Gin Gin lie Gin Gin Met Ala Asp Lys lie Leu 

725 730 735 

Glu Leu Glu Glu Lys. His Arg Glu Ala Gin Val Ser Ala Gin His Leu 
740 745 750 

• Glu Val His Leu Lys Gin Lys Glu Gin His Tyr Glu Glu Lys lie Lys 
755 760 765 

Val Leu Asp Asn Gin lie Lys Lys Asp Leu Ala Asp Lys Glu Thr Leu 

770 775 780 

Glu Asn Met Met Gin Arg His Glu Glu Glu Ala His Glu Lys Gly Lys 
785 790 ♦ 795 800 

lie Leu Ser Glu Gin Lys Ala Met lie Asn Ala Met Asp Ser Lys lie 

805 810 815 

Arg Ser Leu Glu Gin Arg lie Val Glu Leu Ser Glu Ala Asn Lys Leu 

820 825 830 

Ala Ala Asn Ser Ser Leu Phe Thr Gin Arg Asn Met Lys Ala Gin Glu 

835 840 845 

Glu Met lie Ser Glu Leu Arg Gin Gin Lys Phe Tyr Leu Glu Thr Gin 

850 855 860 

Ala Gly Lys Leu Glu Ala Gin Asn Arg Lys Leu Glu Glu Gin Leu Glu 
865 870 875 880 

Lys He Ser His Gin Asp His Ser Asp Lys Asn Arg Leu Leu Glu Leu 

885 890 895 

Glu Thr Arg Leu Arg Glu Val Ser Leu Glu His Glu Glu Gin Lys Leu 

900 905. 910 

Glu Leu Lys Arg Gin Leu Thr Glu Leu Gin Leu Ser Leu Gin Glu Arg 

915 920 925 

Glu Ser Gin Leu Thr Ala Leu Gin Ala Ala Arg Ala Ala Leu Glu Ser 

930 935 940 

Gin Leu Arg Gin Ala Lys Thr Glu Leu Glu Glu Thr Thr Ala Glu Ala 
945 950 955 960 

Glu Glu Glu He Gin Ala Leu Thr Ala His Arg Asp Glu He Gin Arg 

965 970 975 

Lys Phe Asp Ala Leu Arg Asn Ser Cys Thr Val He Thr Asp Leu Glu 

980 985 990 

Glu Gin Leu Asn Gin Leu Thr Glu Asp Asn Ala Glu Leu Asn Asn Gin 

995 1000 1005 

Asn Phe Tyr Leu Ser Lys Gin Leu Asp Glu Ala Ser Gly Ala Asn Asp 

1010 1015 1020 

Glu He Val Gin Leu Arg "Ser Glu Val . Asp His Leu Arg Arg Glu He 
1025 1030 1035 1040 

Thr Glu Arg Glu Met Gin Leu Thr Ser Gin Lys Gin Thr Met Glu Ala 

1045 1050 1055 

Leu Lys Thr Thr Cys Thr Met Leu Glu Glu Gin Val Met Asp Leu Glu 

1060 1065 1070 

Ala Leu Asn Asp Glu Leu Leu Glu Lys Glu Arg Gin Trp Glu Ala Trp • 

1075 1080 1085 

Arg Ser Val Leu Gly Asp Glu Lys Ser Gin Phe Glu Cys Arg Val Arg 

1090 1095 1100 

Glu Leu Gin Arg Met Leu Asp Thr Glu Lys Gin Ser Arg Ala Arg Ala 
1105 1110 1115 1120 

Asp Gin Arg He Thr Glu Ser Arg Gin Val Val Glu Leu Ala Val Lys 

1125 1130 1135 

Glu His Lys Ala Glu He Leu Ala Leu Gin Gin Ala Leu Lys Glu Gin 

1140 1145 1150 

Lys Leu Lys Ala Glu Ser Leu Ser Asp Lys Leu Asn Asp Leu Glu Lys 
1155 1160 1165 
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Lys His Ala Met Leu Glu Met Asn Ala Arg Ser Leu Gin Gin Lys Leu 

1170 1175 1180 

Glu Thr Glu Arg Glu Leu Lys Gin Arg Leu Leu Glu Glu Gin Ala Lys 
1185 1190 1195 1200 

Leu Gin Gin Gin Met Asp Leu Gin Lys Asn His lie Phe Arg Leu Thr 

1205 1210 1215 

Gin Gly Leu Gin Glu Ala Leu Asp Arg Ala Asp Leu Leu Lys Thr Glu 

1220 1225 1230 

Arg Ser Asp Leu Glu Tyr Gin Leu Glu Asn lie Gin Val Leu Tyr Ser 
1235 1240 1245 

* His Glu Lys Val Lys Met Glu Gly Thr lie Ser Gin Gin Thr Lys Leu 
1250 1255 1260 

lie Asp Phe Leu Gin Ala Lys Met Asp Gin Pro Ala Lys Lys Lys Lys 
1265 1270 1275 1280 

Val Pro Leu Gin Tyr Asn Glu Leu Lys Leu Ala Leu Glu Lys Glu Lys 

1285 1290 1295 

Ala Arg Cys Ala Glu Leu Glu Glu Ala Leu Gin Lys Thr Arg lie Glu 

1300 1305 1310 

Leu Arg Ser Ala Arg Glu Glu Ala Ala His Arg Lys Ala Thr Asp His 

1315 1320 1325 

Pro His Pro Ser Thr Pro Ala Thr Ala Arg Gin Gin lie Ala Met Ser 

1330 1335 1340 

Ala lie Val Arg Ser Pro Glu His Gin Pro Ser Ala Met Ser Leu Leu 
1345 1350 1355 1360 

Ala Pro Pro Ser Ser Arg Arg Lys Glu Ser Ser Thr Pro Glu Glu Phe 

1365 1370 1375 

Ser Arg Arg Leu Lys Glu Arg Met His His Asn He Pro His Arg Phe 

1380 1385 1390 

Asn Val Gly Leu Asn Met Arg Ala Thr Lys Cys Ala Val Cys Leu Asp 

1395 1400 1405 

Thr Val His Phe Gly Arg Gin Ala Ser Lys Cys Leu Glu Cys Gin Val 

1410 1415 1420 

Met Cys His Pro Lys Cys Ser Thr Cys Leu Pro Ala Thr. Cys Gly Leu 
1425 1430 1435 1440 

Pro Ala Glu Tyr Ala Thr His Phe Thr Glu Ala Phe Cys Arg Asp Lys 

1445 1450 1455 

Met Asn Ser Pro Gly Leu Gin Thr Lys Glu Pro Ser Ser Ser Leu His 

1460 1465 1470 

Leu Glu Gly Trp Met Lys Val Pro Arg Asn Asn Lys Arg Gly Gin Gin 

1475 1480 1485 

Gly Trp Asp Arg Lys Tyr He Val Leu Glu Gly Ser Lys Val Leu He 

1490 1495 1500 

Tyr Asp Asn Glu Ala Arg Glu Ala Gly Gin Arg Pro Val Glu Glu Phe 
1505 1510 1515 1520 

Glu Leu Cys Leu Pro Asp Gly Asp Val Ser He His Gly Ala Val Gly 

1525 1530 1535 

Ala Ser Glu Leu Ala Asn Thr Ala Lys Ala Asp Val Pro Tyr He Leu 

1540 1545 1550 

Lys Met Glu Ser His Pro His Thr Thr Cys Trp Pro Gly Arg Thr Leu 

1555 1560 1565 

Tyr Leu Leu Ala Pro Ser Phe Pro Asp Lys Gin Arg Trp Val Thr Ala 

1570 1575 1580 

Leu Glu Ser Val Val Ala Gly Gly Arg Val Ser Arg Glu Lys Ala Glu • 
1585 1590 1595 1600 

Ala Asp Ala Lys Leu Leu Gly Asn Ser Leu Leu Lys Leu Glu Gly Asp 
1605 1610 1615 
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Asp Arg Leu Asp Met Asn Cys Thr Leu Pro Phe Ser Asp Gin Val Val 

1620 1625 1630 

Leu Val Gly Thr Glu Glu Gly Leu Tyr Ala Leu Asn Val Leu Lys Asn 

1635 1640 1645 

Ser Leu Thr His Val Pro Gly He Gly Ala Val Phe Gin He Tyr He 

1650 1655 1660 

He Lys Asp Leu Glu Lys Leu Leu Met He Ala Gly Glu Glu Arg Ala 
1665 1670 1675 1680 

Leu Cys Leu Val Asp Val Lys Lys Val Lys Gin Ser Leu Ala Gin Ser 

1685 1690 1695 

His Leu Pro Ala Gin Pro Asp He Ser Pro Asn He Phe Glu Ala Val 

1700 1705 1710 

Lys Gly Cys His Leu Phe Gly Ala Gly Lys He Glu Asn Gly Leu Cys 

1715 1720 1725 

He Cys Ala Ala Met Pro Ser Lys Val Val He Leu Arg Tyr Asn Glu 

1730 1735 1740 

Asn Leu Ser Lys Tyr Cys He Arg Lys Glu He Glu Thr Ser Glu Pro 
1745 1750 1755 1760 

Cys Ser Cys lie His Phe Thr Asn Tyr Ser He Leu lie Gly Thr Asn 

1765 1770 1775 

Lys Phe Tyr Glu He Asp Met Lys Gin Tyr Thr Leu Glu Glu Phe Leu . 

1780 1785 1790 

Asp Lys Asn Asp His Ser Leu Ala Pro Ala Val Phe Ala Ala Ser Ser 

1795 1800 1805 

Asn Ser Phe Pro Val Ser lie Val Gin Val Asn Ser Ala Gly Gin Arg 

1810 1815 1820 

Glu Glu Tyr Leu Leu Cys Phe His Glu Phe Gly Val Phe Val Asp Ser 
1825 1830 1835 1840 

Tyr Gly Arg Arg Ser Arg Thr Asp Asp Leu Lys Trp Ser Arg Leu Pro 

1845 1850 1855 

Leu Ala Phe Ala Tyr Arg Glu Pro Tyr Leu Phe Val Thr His Phe Asn 

1860 1865 1870 

Ser Leu Glu Val He Glu lie Gin Ala Arg Ser Ser Ala Gly Thr Pro 

1875 1880 1885 

Ala Arg Ala Tyr Leu Asp lie Pro Asn Pro Arg Tyr Leu Gly Pro Ala 

1890 1895 1900 

He Ser Ser Gly Ala He Tyr Leu Ala Ser Ser Tyr Gin Asp Lys Leu 
1905 1910 1915 1920 

Arg Val He Cys Cys Lys Gly Asn Leu Val Lys Glu Ser Gly Thr Glu 

1925 1930 1935 

His His Arg Gly Pro Ser Thr Ser Arg Ser Ser Pro Asn Lys Arg Gly 

1940 1945 1950 

Pro Pro Thr Tyr Asn Glu His lie Thr Lys Arg Val Ala Ser Ser Pro 

1955 1960 1965 

Ala Pro Pro Glu Gly Pro Ser His Pro Arg Glu Pro Ser Thr Pro His 

1970 1975 1980 v 

Arg Tyr Arg Glu Gly Arg Thr Glu Leu Arg Arg Asp Lys Ser Pro Gly 
1985 1990 1995 2000 

Arg Pro Leu Glu Arg Glu Lys Ser Pro Gly Arg lie Leu Ser Thr Arg 

2005 2010 2015 

Arg Glu Arg Ser Pro Ala Arg Leu Phe Glu Asp Ser Ser Arg Gly Arg 

2020 2025 2030 

Leu Pro Ala Gly Ala Val Arg Thr Pro Leu Ser Gin Val Asn Lys Val 

2035 2040 2045 

Trp Asp Gin Ser Ser Val 
2050 
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<210> 3 

<211> 5877 

<212> DNA 

<213> homo sapiens 

<400> 3 

atgttgaagt tcaaatatgg agcgcggaat cctttggatg ctggtgctgc tgaacccatt 60 

gccagccggg cctccaggct gaatctgttc ttccagggga aaccaccctt tatgactcaa 120 

cagcagatgt ctcctctttc ccgagaaggg atattagatg ccctctttgt tctctttgaa 180 

gaatgcagtc agcctgctct gatgaagatt aagcacgtga gcaactttgt ccggaagtat 240 

tccgacacca tagctgagtt acaggagctc cagccttcgg caaaggactt cgaagtcaga 300 

agtcttgtag gttgtggtca ctttgctgaa gtgcaggtgg taagagagaa agcaaccggg 360 

gacatctatg ctatgaaagt gatgaagaag aaggctttat tggcccagga gcaggtttca 420 

ttttttgagg aagagcggaa catattatct cgaagcacaa gcccgtggat cccccaatta 480 

cagtatgcct ttcaggacaa aaatcacctt tatctggtca tggaatatca gcctggaggg 540 

gacttgctgt cacttttgaa tagatatgag gaccagttag atgaaaacct gatacagttt 600 

tacctagctg agctgatttt ggctgttcac agcgttcatc tgatgggata cgtgcatcga 660 

gacatcaagc ctgagaacat tctcgttgac cgcacaggac acatcaagct ggtggatttt 720 

ggatctgccg cgaaaatgaa ttcaaacaag atggtgaatg ccaaactccc gattgggacc 780 

ccagattaca tggctcctga agtgctgact gtgatgaacg gggatggaaa aggcacctac 840 

ggcctggact gtgactggtg gtcagtgggc gtgattgcct atgagatgat ttatgggaga 900 

tcccccttcg cagagggaac ctctgccaga accttcaata acattatgaa tttccagcgg 960 

tttttgaaat ttccagatga ccccaaagtg agcagtgact ttcttgatct gattcaaagc 1020 

ttgttgtgcg gccagaaaga gagactgaag tttgaaggtc tttgctgcca tcctttcttc 1080 

tctaaaattg actggaacaa cattcgtaac tctcctcccc ccttcgttcc caccctcaag 1140 

tctgacgatg acacctccaa ttttgatgaa ccagagaaga attcgtgggt ttcatcctct 1200 

ccgtgccagc tgagcccctc aggcttctcg ggtgaagaac tgccgtttgt ggggttttcg 1260 

tacagcaagg cactggggat tcttggtaga tctgagtctg ttgtgtcggg tctggactcc 1320 

cctgccaaga ctagctccat ggaaaagaaa cttctcatca aaagcaaaga gctacaagac 1380 

tctcaggaca agtgtcacaa gatggagcag gaaatgaccc ggttacatcg gagagtgtca 1440 

gaggtggagg ctgtgcttag tcagaaggag gtggagctga aggcctctga gactcagaga 1500 

tccctcctgg agcaggacct tgctacctac atcacagaat gcagtagctt aaagcgaagt 1560 

ttggagcaag cacggatgga ggtgtcccag gaggatgaca aagcactgca gcttctccat 1620 

gatatcagag agcagagccg gaagctccaa gaaatcaaag agcaggagta ccaggctcaa 1680 

gtggaagaaa tgaggttgat gatgaatcag ttggaagagg atcttgtctc agcaagaaga 1740 

cggagtgatc tctacgaatc tgagctgaga gagtctcggc ttgctgctga agaattcaag 1800 

cggaaagcga cagaatgtca gcataaactg ttgaaggcta aggatcaagg gaagcctgaa 1860 

gtgggagaat atgcgaaact ggagaagatc aatgctgagc agcagctcaa aattcaggag 1920 

ctccaagaga aactggagaa ggctgtaaaa gccagcacgg aggccaccga gctgctgcag 1980 

aatatccgcc aggcaaagga gcgagccgag agggagctgg agaagctgca gaaccgagag 2040 

gattcttctg aaggcatcag aaagaagctg gtggaagctg aggaacgccg ccattctctg 2100 

gagaacaagg taaagagact agagaccatg gagcgtagag aaaacagact. gaaggatgac 2160 A 

atccagacaa aatcccaaca gatccagcag atggctgata aaattctgga gctcgaagag 2220 

aaacatcggg aggcccaagt ctcagcccag cacctagaag tgcacctgaa acagaaagag 2280 

cagcactatg aggaaaagat taaagtgttg gacaatcaga taaagaaaga cctggctgac 2340 

aaggagacac tggagaacat gatgcagaga cacgaggagg aggcccatga gaagggcaaa 2400 

attctcagcg aacagaaggc gatgatcaat gctatggatt ccaagatcag atccctggaa 2460 

cagaggattg tggaactgtc tgaagccaat aaacttgcag caaatagcag tctttttacc 2520 

caaaggaaca tgaaggccca agaagagatg atttctgaac tcaggcaaca gaaattttac 2580 

ctggagacac aggctgggaa gttggaggcc cagaaccgaa aactggagga gcagctggag 2640 

aagatcagcc accaagacca cagtgacaag aatcggctgc tggaactgga gacaagattg 2700 

cgggaggtca gtctagagca cgaggagcag aaactggagc tcaagcgcca gctcacagag 2760 

ctacagctct ccctgcagga gcgcgagtca cagttgacag ccctgcaggc tgcacgggcg 2820 

gccctggaga gccagcttcg ccaggcgaag acagagctgg aagagaccac agcagaagct 2880 

gaagaggaga tccaggcact cacggcacat agagatgaaa tccagcgcaa atttgatgct 2940 

cttcgtaaca gctgtactgt aatcacagac ctggaggagc agctaaacca gctgaccgag 3000 
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gacaacgctg aactcaacaa ccaaaacttc tacttgtcca aacaactcga tgaggcttct 3060 

ggcgccaacg acgagattgt acaactgcga agtgaagtgg accatctccg ccgggagatc 3120 

acggaacgag agatgcagct taccagccag aagcaaacga tggaggctct gaagaccacg 3180 

tgcaccatgc tggaggaaca ggtcatggat ttggaggccc taaacgatga gctgctagaa 3240 

aaagagcggc agtgggaggc ctggaggagc gtcctgggtg atgagaaatc ccagtttgag 3300 

tgtcgggttc gagagctgca gaggatgctg gacaccgaga aacagagcag ggcgagagcc 3360 

gatcagcgga tcaccgagtc tcgccaggtg gtggagctgg cagtgaagga gcacaaggct 3420 

gagattctcg ctctgcagca ggctctcaaa gagcagaagc tgaaggccga gagcctctct 3480 

gacaagctca atgacctgga gaagaagcat gctatgcttg aaatgaatgc ccgaagctta 3540 

cagcagaagc tggagactga acgagagctc aaacagaggc ttctggaaga gcaagccaaa 3600 

ttacagcagc agatggacct gcagaaaaat cacattttcc gtctgactca aggactgcaa 3 660 

gaagctctag atcgggctga tctactgaag acagaaagaa gtgacttgga gtatcagctg 3720 

gaaaacattc aggttctcta ttctcatgaa aaggtgaaaa tggaaggcac tatttctcaa 3780 

caaaccaaac tcattgattt tctgcaagcc aaaatggacc aacctgctaa aaagaaaaag 3840 

gttcctctgc agtacaatga gctgaagctg gccctggaga aggagaaagc tcgctgtgca 3900 

gagctagagg aagcccttca gaagacccgc atcgagctcc ggtccgcccg ggaggaagct 3960 

gcccaccgca aagcaacgga ccacccacac ccatccacgc cagccaccgc gaggcagcag 4020 

atcgccatgt ccgccatcgt gcggtcgcca gagcaccagc ccagtgccat gagcctgctg 4080 

gccccgccat ccagccgcag aaaggagtct tcaactccag aggaatttag tcggcgtctt 4140 

aaggaacgca tgcaccacaa tattcctcac cgattcaacg taggactgaa catgcgagcc 4200 

acaaagtgtg ctgtgtgtct ggataccgtg cactttggac gccaggcatc caaatgtctc 4260 

gaatgtcagg tgatgtgtca ccccaagtgc tccacgtgct tgccagccac ctgcggcttg 4320 

cctgctgaat atgccacaca cttcaccgag gccttctgcc gtgacaaaat gaactcccca 4380 

ggtctccaga ccaaggagcc cagcagcagc ttgcacctgg aagggtggat gaaggtgccc 4440 

aggaataaca aacgaggaca gcaaggctgg gacaggaagt acattgtcct ggagggatca 4500 

aaagtcctca tttatgacaa tgaagccaga gaagctggac agaggccggt ggaagaattt 4560 

gagctgtgcc ttcccgacgg ggatgtatct attcatggtg ccgttggtgc ttccgaactc 4620 

gcaaatacag ccaaagcaga tgtcccatac atactgaaga tggaatctca cccgcacacc 4680 

acctgctggc ccgggagaac cctctacttg ctagctccca gcttccctga caaacagcgc 4740 

tgggtcaccg ccttagaatc agttgtcgca ggtgggagag tttctaggga aaaagcagaa 4800 

gctgatgcta aactgcttgg aaactccctg ctgaaactgg aaggtgatga ccgtctagac 4860 

atgaactgca cgctgccctt cagtgaccag gtggtgttgg tgggcaccga ggaagggctc 4920 

tacgccctga atgtcttgaa aaactcccta acccatgtcc caggaattgg agcagtcttc 4980 

caaatttata ttatcaagga cctggagaag ctactcatga tagcaggaga agagcgggca 5040 

ctgtgtcttg tggacgtgaa gaaagtgaaa cagtccctgg cccagtccca cctgcctgcc 5100 

cagcccgaca tctcacccaa catttttgaa gctgtcaagg gctgccactt gtttggggca 5160 

ggcaagattg agaacgggct ctgcatctgt gcagccatgc ccagcaaagt cgtcattctc 5220 

cgctacaacg aaaacctcag caaatactgc atccggaaag agatagagac ctcagagccc 5280 

tgcagctgta tccacttcac caattacagt atcctcattg gaaccaataa attctacgaa 5340 

atcgacatga agcagtacac gctcgaggaa ttcctggata agaatgacca ttccttggca ' 5400 

cctgctgtgt ttgccgcctc ttccaacagc ttccctgtct caatcgtgca ggtgaacagc 5460 

gcagggcagc gagaggagta cttgctgtgt ttccacgaat ttggagtgtt cgtggattct 5520 

tacggaagac gtagccgcac agacgatctc aagtggagtc gcttaccttt ggcctttgcc . 5580 

tacagagaac cctatctgtt tgtgacccac ttcaactcac tcgaagtaat tgagatccag 5640 

gcacgctcct cagcagggac ccctgcccga gcgtacctgg acatcccgaa cccgcgctac 5700 

ctgggccctg ccatttcctc aggagcgatt tacttggcgt cctcatacca ggataaatta 5760 

agggtcattt gctgcaaggg aaacctcgtg aaggagtccg gcactgaaca ccaccggggc 5820 

ccgtccacct cccgcagatt tcaaagccat atggctagag atgaatataa accttga 5877 

<210> 4 
<211> 1958 
<212> PRT 

<213> homo sapiens 
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<400> 4 

Met Leu Lys Phe Lys Tyr Gly Ala Arg Asn Pro Leu Asp Ala Gly Ala 

15 10 15 

Ala Glu Pro lie Ala Ser Arg Ala Ser Arg Leu Asn Leu Phe Phe Gin 

20 25 30 

Gly Lys Pro Pro Phe Met Thr Gin Gin Gin Met Ser Pro Leu Ser Arg 

35 40 45 

Glu Gly He Leu Asp Ala Leu Phe Val Leu Phe Glu Glu Cys Ser Gin 

50 55 60 

Pro Ala Leu Met Lys He Lys His Val Ser Asn Phe Val Arg Lys Tyr 
65 70 75 80 

Ser Asp Thr He Ala Glu Leu Gin Glu Leu Gin Pro Ser Ala Lys Asp 

85 90 95 

Phe Glu Val Arg Ser Leu Val Gly Cys Gly His Phe Ala Glu Val Gin 

100 105 110 

Val Val Arg Glu Lys Ala Thr Gly Asp He Tyr Ala Met Lys Val Met 

115 120 125 

Lys Lys Lys Ala Leu Leu Ala Gin Glu Gin Val Ser Phe Phe Glu Glu 

130 135 140 

Glu Arg Asn He Leu Ser Arg Ser Thr Ser Pro Trp He Pro Gin Leu 
145 150 155 160 

Gin Tyr Ala Phe Gin Asp Lys Asn His Leu Tyr Leu Val Met Glu Tyr 

165 170 175 

Gin Pro Gly Gly Asp Leu Leu Ser Leu Leu Asn Arg Tyr Glu Asp Gin 

180 185 190 

Leu Asp Glu Asn Leu He Gin Phe Tyr Leu Ala Glu Leu He Leu Ala 

195 200 205 

Val His Ser Val His Leu Met Gly Tyr Val His Arg Asp He Lys Pro 

210 215 " 220 

Glu Asn He Leu Val Asp Arg Thr Gly His He Lys Leu Val Asp Phe 
225 230 235 240 

Gly Ser Ala Ala Lys Met Asn Ser Asn Lys Met Val Asn Ala Lys Leu 

245 250 255 

Pro He Gly Thr Pro Asp Tyr Met Ala Pro Glu Val Leu Thr Val Met 

260 265 270 

Asn Gly Asp Gly Lys Gly Thr Tyr Gly Leu Asp Cys Asp Trp Trp Ser 

275 280 285 

Val Gly Val He Ala Tyr Glu Met He Tyr Gly Arg Ser Pro Phe Ala 

290 295 300 

Glu Gly Thr Ser Ala Arg Thr Phe Asn Asn He Met Asn Phe Gin Arg 
305 310 315 320 

Phe Leu Lys Phe Pro Asp Asp Pro Lys Val Ser Ser Asp Phe Leu Asp 

325 330 335 

Leu He Gin Ser Leu Leu Cys Gly Gin Lys Glu Arg Leu Lys Phe Glu 

340 345 350 

Gly Leu Cys Cys His Pro Phe Phe Ser Lys He Asp Trp Asn Asn lie 

355 . 360 365 

Arg Asn Ser Pro Pro Pro Phe Val Pro Thr Leu Lys Ser Asp Asp Asp 

370 375 380 

Thr Ser Asn Phe Asp Glu Pro Glu Lys Asn Ser Trp Val . Ser Ser Ser 
385 390 395 400 

Pro Cys Gin Leu Ser Pro Ser Gly Phe Ser Gly Glu Glu Leu Pro Phe 

405 410 415 

Val Gly Phe Ser Tyr Ser Lys Ala Leu Gly lie Leu Gly Arg Ser Glu 

420 425 430 

Ser Val Val Ser Gly Leu Asp Ser Pro Ala Lys Thr Ser Ser Met Glu 
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435 440 445 

Lys Lys Leu Leu lie Lys Ser Lys Glu Leu Gin Asp Ser Gin Asp Lys 

450 455 460 

Cys His Lys Met Glu Gin Glu Met Thr Arg Leu His Arg Arg Val Ser 
465 470 475 480 

Glu Val Glu Ala Val Leu Ser Gin Lys Glu Val Glu Leu Lys Ala Ser 

485 490 495 

Glu Thr Gin Arg Ser Leu Leu Glu Gin Asp Leu Ala Thr Tyr lie Thr 

500 505 510 

Glu Cys Ser Ser Leu Lys Arg Ser Leu Glu Gin Ala Arg Met Glu Val 

515 520 525 

Ser Gin Glu Asp Asp Lys Ala Leu Gin Leu Leu His Asp lie Arg Glu 

530 535 540 

Gin Ser Arg Lys Leu Gin Glu lie Lys Glu Gin Glu Tyr Gin Ala Gin 
545 550 555 560 

Val Glu Glu Met Arg Leu Met Met Asn Gin Leu Glu Glu Asp Leu Val 

565 570 575 

Ser Ala Arg Arg Arg Ser Asp Leu Tyr Glu Ser Glu Leu Arg Glu Ser 

580 585 590 

Arg Leu Ala Ala Glu Glu Phe Lys Arg Lys Ala Thr Glu Cys Gin His 

595 600 605 

Lys Leu Leu Lys Ala Lys Asp Gin Gly Lys Pro Glu Val Gly Glu Tyr 

610 615 620 

Ala Lys Leu Glu Lys lie Asn Ala Glu Gin Gin Leu Lys lie Gin Glu 
625 630 635 640 

Leu Gin Glu Lys Leu Glu Lys Ala Val Lys Ala Ser Thr Glu Ala Thr 

645 650 655 

Glu Leu Leu Gin Asn lie Arg Gin Ala Lys Glu Arg Ala Glu Arg Glu 

660 665 670 

Leu Glu Lys Leu Gin Asn Arg Glu Asp Ser Ser Glu Gly lie Arg Lys 

675 680 685 

Lys Leu Val Glu Ala Glu Glu Arg Arg His Ser Leu Glu Asn Lys Val 

690 695 700 

Lys Arg Leu Glu Thr Met Glu Arg Arg Glu Asn Arg Leu Lys Asp Asp 
705 710 715 720 

lie Gin Thr Lys Ser Gin Gin lie Gin Gin Met Ala Asp Lys lie Leu 

725 730 735 

Glu Leu Glu Glu Lys His Arg Glu Ala Gin Val Ser Ala Gin His Leu 

740 745 750 

Glu Val His Leu Lys Gin Lys Glu Gin His Tyr Glu Glu Lys lie Lys 

755 760 765 

Val Leu Asp Asn Gin lie Lys Lys Asp Leu Ala Asp Lys Glu Thr Leu 

770 775 780 

Glu Asn Met Met Gin Arg His Glu Glu Glu Ala His Glu Lys Gly Lys 
785 790 795 800 

He Leu Ser Glu Gin Lys Ala Met He Asn Ala Met Asp Ser Lys He 

805 810 815 

Arg Ser Leu Glu Gin Arg He Val Glu Leu Ser Glu Ala Asn Lys Leu 

820 825 830 

Ala Ala Asn Ser Ser Leu Phe Thr Gin Arg Asn Met Lys Ala Gin Glu 

835 840 845 

Glu Met He Ser Glu Leu Arg Gin Gin Lys Phe Tyr Leu Glu Thr Gin 

850 855 860 

Ala Gly Lys Leu Glu Ala Gin Asn Arg Lys Leu Glu Glu Gin Leu Glu 
865 870 875 880 

Lys He Ser His Gin Asp His Ser Asp Lys Asn Arg Leu Leu Glu Leu 
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885 


Oft A 

890 








895 


Glu Thr. Arg 


Leu 


Arg Glu Val 


Ser Leu Glu 


His 


Glu 


Glu Gin 


Lys Leu 




900 




905 






910 




Glu Leu Lys 


Arg 


Gin Leu Thr 


Glu Leu Gin 


Leu 


Ser 


Leu Gin 


Glu Arg 


915 






920 






925 




Glu Ser Gin 


Leu 


Thr Ala Leu 


Gin Ala Ala 


Arg 


Ala 


Ala Leu 


Glu Ser 


930 




935 






940 






Gin Leu Arg 


Gin 


Ala Lys Thr 


Glu Leu Glu 


Glu 


Thr 


Thr Ala 


Glu Ala 


945 




950 




955 






960 


Glu Glu Glu 


lie 


Gin Ala Leu 


Thr Ala His 


Arg 


Asp 


Glu He 


Gin Arg 






965 


970 








975 


Lys Phe Asp 


Ala 


Leu Arg Asn 


Ser Cys Thr 


Val 


He 


Thr Asp 


Leu Glu 




980 




985 






990 




Glu Gin Leu 


Asn 


Gin Leu Thr 


Glu Asp Asn 


Ala 


Glu 


Leu Asn 


Asn Gin 


995 






1000 






1005 




Asn Phe Tyr 


Leu 


Ser Lys Gin 


Leu Asp Glu 


Ala 


Ser 


Gly Ala 


Asn Asp 


1010 




1015 




1020 




Glu lie Val 


Gin 


Leu Arg Ser 


Glu Val Asp 


His 


Leu Arg Arg 


Glu He 


1025 




1030 




1035 




1040 


Thr Glu Arg 


Glu 


Met Gin Leu 


Thr Ser Gin 


Lys 


Gin 


Thr Met 


Glu Ala 






1045 


1050 






1055 


Leu Lys Thr 


Thr 


Cys Thr Met 


Leu Glu Glu 


Gin 


Val 


Met Asp 


Leu Glu 




1060 


1065 






1070 


Ala Leu Asn 


Asp Glu Leu Leu Glu Lys Glu Arg Gin Trp Glu Ala Trp 


1075 




1080 






1085 




Arg Ser Val 


Leu Gly Asp Glu 


Lys Ser Gin 


Phe 


Glu 


Cys Arg Val Arg 


1090 




1095 




1100 




Glu Leu Gin Arg Met Leu Asp Thr Glu Lys Gin Ser Arg Ala Arg Ala 


1105 




1110 




1115 




1120 


Asp Gin Arg 


lie 


Thr Glu Ser 


Arg Gin Val 


Val 


Glu 


Leu Ala 


Val Lys 






1125 


1130 






1135 


Glu His Lys 


Ala 


Glu lie Leu 


Ala Leu Gin 


Gin 


Ala 


Leu Lys 


Glu Gin 



1140 1145 1150 



Lys Leu Lys Ala Glu Ser Leu Ser Asp Lys Leu Asn Asp Leu Glu Lys 

1155 1160 1165 

Lys His Ala Met Leu Glu Met Asn Ala Arg Ser Leu Gin Gin Lys Leu 

1170 1175 1180 

Glu Thr Glu Arg Glu Leu Lys Gin Arg Leu Leu Glu Glu Gin Ala Lys 
1185 1190 1195 1200 

Leu Gin Gin Gin Met Asp Leu Gin Lys Asn His lie Phe Arg Leu Thr 

1205 1210 1215 

Gin Gly Leu Gin Glu Ala Leu Asp Arg Ala Asp Leu Leu Lys Thr Glu 

1220 1225 1230 

Arg Ser Asp Leu Glu Tyr Gin Leu Glu Asn He Gin Val Leu Tyr Ser 

1235 1240 1245 

His Glu Lys Val Lys Met Glu Gly Thr He Ser Gin Gin Thr Lys Leu 

1250 1255 1260 ■ 

He Asp Phe Leu Gin Ala Lys Met Asp. Gin Pro Ala Lys Lys Lys Lys 
1265 1270 1275 1280 

Val Pro Leu Gin Tyr Asn Glu Leu Lys Leu Ala Leu Glu Lys Glu Lys 

1285 1290 1295 

Ala Arg Cys Ala Glu Leu Glu Glu Ala Leu Gin Lys Thr Arg He Glu 

1300 1305 1310 

Leu Arg Ser Ala Arg Glu Glu Ala Ala His Arg Lys Ala Thr Asp His 

1315 1320 1325 

Pro His Pro Ser Thr. Pro Ala Thr Ala Arg Gin Gin He Ala Met Ser 
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1330 1335 1340 

Ala lie Val Arg Ser Pro Glu His Gin Pro Ser Ala Met Ser Leu Leu 
1345 1350 1355 1360 

Ala Pro Pro Ser Ser Arg Arg Lys Glu Ser Ser Thr Pro Glu Glu Phe 

1365 1370 1375 

Ser Arg Arg Leu Lys Glu Arg Met His His Asn lie Pro His Arg Phe 

1380 1385 1390 

Asn Val Gly Leu Asn Met Arg Ala Thr Lys Cys Ala Val Cys Leu Asp . 

1395 1400 1405 

Thr Val His Phe Gly Arg Gin Ala Ser Lys Cys Leu Glu Cys Gin Val 

1410 1415 1420 

Met Cys His Pro Lys Cys Ser Thr Cys Leu Pro Ala Thr Cys Gly Leu 
1425 1430 1435 1440 

Pro Ala Glu Tyr Ala Thr His Phe Thr Glu Ala Phe Cys Arg Asp Lys 

1445 1450 1455 

Met Asn Ser Pro Gly Leu Gin Thr Lys Glu Pro Ser Ser Ser Leu His 

1460 1465 1470 

Leu Glu Gly Trp Met Lys Val Pro Arg Asn Asn Lys Arg Gly Gin Gin 

1475 1480 1485 

Gly Trp Asp Arg Lys Tyr lie Val Leu Glu Gly Ser Lys Val Leu lie 

1490 1495 1500 

Tyr Asp Asn Glu Ala Arg Glu Ala Gly Gin Arg Pro Val Glu Glu Phe 
1505 1510 1515 1520 

Glu Leu Cys Leu Pro Asp Gly Asp Val Ser lie His Gly Ala Val Gly 

1525 1530 1535 

Ala Ser Glu Leu Ala Asn Thr Ala Lys Ala Asp Val Pro Tyr lie Leu 

1540 1545 1550 

Lys Met Glu Ser His Pro His Thr Thr Cys Trp Pro Gly Arg Thr Leu 

1555 1560 1565 

Tyr Leu Leu Ala Pro Ser Phe Pro Asp Lys Gin Arg Trp Val Thr Ala 

1570 1575 1580 

Leu Glu Ser Val Val Ala Gly Gly Arg Val Ser Arg Glu Lys Ala Glu 
1585 ' 1590 1595 1600 

Ala Asp Ala Lys Leu Leu Gly Asn Ser Leu Leu Lys Leu Glu Gly Asp 

1605 1610 1615 

Asp Arg Leu Asp Met Asn Cys Thr Leu Pro Phe Ser Asp Gin Val Val 

1620 1625 1630 

Leu Val Gly Thr Glu Glu Gly Leu Tyr Ala Leu Asn Val Leu Lys Asn 

1635 1640 1645 

Ser Leu Thr His Val Pro Gly He Gly Ala Val Phe Gin He Tyr He 

1650 1655 1660 

He Lys Asp Leu Glu Lys Leu Leu Met He Ala Gly Glu Glu Arg Ala 
1665 1670 1675 1680 

Leu Cys Leu Val Asp Val Lys Lys Val Lys Gin Ser Leu Ala Gin Ser 

1685 1690 1695 

His Leu Pro Ala Gin Pro Asp He Ser Pro Asn He Phe Glu Ala Val 

1700 1705 1710 

Lys Gly Cys His Leu Phe Gly Ala Gly Lys He Glu Asn Gly Leu Cys 

1715 1720 1725 

He Cys Ala Ala Met Pro Ser Lys Val Val lie Leu Arg Tyr Asn Glu 

1730 1735 1740 

Asn Leu Ser Lys Tyr Cys He Arg Lys Glu He Glu Thr Ser Glu Pro 
1745 1750 1755 1760 

Cys Ser Cys lie His Phe Thr Asn Tyr Ser He Leu lie Gly Thr Asn 

1765 1770 1775 

Lys Phe Tyr Glu He Asp Met Lys Gin Tyr Thr Leu Glu Glu Phe Leu 
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1780 1785 1790 

Asp Lys Asn Asp His Ser Leu Ala Pro Ala Val Phe -Ala Ala Ser Ser 

1795 1800 1805 

Asn Ser Phe Pro Val Ser He Val Gin Val Asn Ser Ala Gly Gin Arg 

1810 1815 1820 

Glu Glu Tyr Leu Leu Cys Phe His Glu Phe Gly Val Phe Val Asp Ser 
1825 1830 1835 1840 

Tyr Gly Arg Arg Ser Arg Thr Asp Asp Leu Lys Trp Ser Arg Leu Pro 

1845 1850 1855 

Leu Ala Phe Ala Tyr Arg Glu Pro Tyr Leu Phe Val Thr His Phe Asn 

1860 1865 1870 

Ser Leu Glu Val He Glu He Gin Ala Arg Ser Ser Ala Gly Thr Pro 

. 1875 1880 1885 

Ala Arg Ala Tyr Leu Asp He Pro Asn Pro Arg Tyr Leu Gly Pro Ala 

1890 1895 1900 

He Ser Ser Gly Ala He Tyr Leu Ala Ser Ser Tyr Gin Asp Lys Leu 
1905 1910 1915 1920 

Arg Val He Cys Cys Lys Gly Asn Leu Val Lys Glu Ser Gly Thr Glu 

1925 1930 1935 

His His Arg Gly Pro Ser Thr Ser Arg Arg Phe Gin Ser His Met Ala 

1940 1945 1950 

Arg Asp Glu Tyr Lys Pro 
1955 
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